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G7: Balance security and collaboration 


ncreased government scrutiny of cross-border univer- 
sity research relationships, tightened export controls 
on technologies, and strengthened national regimes 
regulating technology-related foreign direct invest- 
ment are now priorities for most democracies. These 
policy changes are motivated by the common goal of 
shoring up economic and national security. But the 
approaches are neither uniform nor harmonized, even 
among the relatively homogeneous G7 nations, under- 
mining cross-border research and development (R&D) 
collaboration. When the leaders of G7 meet in late June in 
Schloss Elmau, Germany, they should make it a priority to 
coordinate controls on knowledge flows and technology. 
They need to act together to demonstrate how democra- 
cies can counter illicit activities 
for acquiring technologies. 

The issue of research security 
bubbled up on university cam- 
puses in the US almost 5 years 
ago as questions about technol- 
ogy exports to, and acquisitions 
of US firms by, China raised 
concerns about the economic, 
military, and intelligence vulner- 
abilities of G7 nations. Then in 
February 2022, Russia invaded 
Ukraine. Many of the world’s 
liberal democracies banded to- 
gether—in record time and a 
highly coordinated fashion—to impose far-reaching ex- 
port controls to prevent advanced technology products 
from reaching Russia. Democracies in general, and the G7 
in particular, have awoken to the fact that they have the 
means and powerful new motivations to more carefully 
control the diffusion of dual-use knowledge and advanced 
technologies to adversaries. 

The result is looming changes—in law and enforce- 
ment—of national policies in advanced democracies. The 
new approaches fall to different governmental entities, 
depending on whether the policy is the granting of export 
licenses; agency clearance of inbound—and likely soon 
even outbound—foreign direct investment; or the fund- 
ing of university research. The result—leading democratic 
states rapidly inventing or strengthening technology- 
protection polices with little multilateral consultation—is 
predictable: regulatory confusion for both researchers 
and companies engaged in cross-border activity, and a re- 
duction in international flows of scientific and engineer- 
ing knowledge and personnel. 

There are huge and long-term costs to innovation and 
economic growth of allies taking distinct approaches to 


“il...needs.... 
a process to separate 


high-risk from low-risk 
commercial technology 
exchanges...” 


export controls, placing limits on university-based open 
research collaboration between domestic and foreign 
scientists, or imposing constraints on foreign direct in- 
vestment. All three of these national security policies can 
substantially affect beneficial cross-border research col- 
laboration and commercial exchange in advanced tech- 
nologies. But they are rarely considered together. They 
should be. Ensuring such integration is where the G7 
must lead. Indeed, the G7 needs to act rapidly. 

Because of the decentralized nature of the changes re- 
quired, it is hard to know whether this imperative will 
even be recognized as an emerging problem when the 
G7 leaders meet later this month. Whereas balancing re- 
search security and openness in research was identified 
as a key topic in the “Research 
Compact” unveiled by the G7 
at last year’s summit in Corn- 
wall, UK, there does not appear 
to be any focus planned on the 
impact of tightening the down- 
stream—and arguably far more 
important—technology export 
controls and foreign direct in- 
vestment regulations. 

The first step is for the G7 to 
agree on the principles of a new 
regime: making the implemen- 
tation of national regulations 
smarter about global knowledge 
networks, coordinating to facilitate openness among the 
G7 nations and control at the interface between those 
within the group and those outside; and ensuring harmo- 
nization that supports cross-border collaboration in pub- 
lic and private R&D and innovation within the G7. 

The G7 leaders cannot design such a regime during 
their 2022 summit, but they can agree that it needs to be 
done on a priority basis and formally launch the process 
of creating modernized, enforceable, cross-border science 
and technology agreements governing not just govern- 
ment-to-government research exchanges but also setting 
out a process to separate high-risk from low-risk commer- 
cial technology exchanges and foreign direct investments. 
And they need to be quick, as the sovereign-to-sovereign 
agreements among the G7 must be in place soon so that 
other liberal democracies can align their policies and 
practices with the G7. 

The G7 should lead democracies out of the thicket of 
conflicting, collaboration-dampening, and national ap- 
proaches to protecting the technology-dependent aspect 
of economic and national security. 

-Harry G. Broadman and Chaouki Abdallah 
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A future for Ukrainian science 


s the war in Ukraine enters its fourth month, 
Russian forces continue to destroy the nation’s 
scientific institutions and infrastructure, signal- 
ing Russia’s intent to obliterate the future for 
Ukraine. In Kharkiv, for instance, the renowned 
Institute of Physics and Technology and its newly 
built Neutron Source nuclear facility have been 
heavily damaged. Even the Plant Production Institute 
with its underground national seed bank—one of the 
world’s largest—has been bombed. At the Chernobyl nu- 
clear labs, Russian forces have looted or destroyed hun- 
dreds of computers, radiation dosimeters, and irreplace- 
able software and equipment. Although the response to 
each international science crisis is necessarily unique, the 
US National Academy of Sciences is 
once again joining with internation- 
al and regional partners to support 
beleaguered colleagues, as it did last 
year in the successful extraction and 
resettlement of Afghanistan scien- 
tists at risk from the Taliban. To that 
end, the national science academies 
of Poland, Ukraine, and the United 
States recently convened a meet- 
ing of leaders from several national 
science academies (including the 
presidents of Germany’s Leopoldina 
science academy, the Royal Danish 
Academy of Sciences and Letters, 
and the ALLEA European Federation of Academies of 
Sciences and Humanities, and leaders from the Royal So- 
ciety of the United Kingdom) to explore how the global 
science community can best help Ukraine. The resulting 
10-point action plan for the world’s research community 
aims to help meet several immediate needs and also pro- 
vide the building blocks for revitalizing Ukrainian science 
in the future. 

The plan seeks to support researchers working within 
Ukrainian institutions, as well as their teams—whether 
within Ukraine or in neighboring countries—both sci- 
entifically and financially so that these generations of 
researchers will not be lost to science or to Ukraine. 
Along with the traditional types of support provided by 
international cooperation and collaboration, funders 
are considering developing grant programs for joint 
research, particularly if conducted inside Ukraine. 
Some journal publishers have waived Ukraine’s insti- 
tutional subscription charges and author publication 
charges, and scientific societies are considering tempo- 
rarily suspending membership dues. Such steps open 
the opportunity to immediately increase connections 


“Itis...vital 
that science 


be a cornerstone 
of any postwar 
reconstruction...” 


of Ukrainian researchers to thriving international re- 
search communities. It is also vital that science be a 
cornerstone of any postwar reconstruction of Ukraine. 

There is also an urgent need for donated instruments, 
including various types of microscopes, spectrometers, 
and materials testing machines to keep these research- 
ers productive. At the same time, many female scholars 
forced to flee, most with children, need temporary re- 
search positions and financial support for themselves 
and their research teams until they can return home. 
The Polish Academy of Sciences and the US National 
Academy of Sciences have partnered to place temporar- 
ily some 220 Ukrainian researchers in Polish science 
institutions and provide financial support so that they 
can continue to make valuable con- 
tributions to research. Programs 
like this will hopefully expand 
within Poland and into other Eu- 
ropean countries such as Germany, 
the United Kingdom, and Denmark 
while allowing these scientists to 
retain their Ukrainian science affil- 
iations until they can return home. 

Lessons learned during the 
COVID-19 pandemic can also be ap- 
plied to help Ukrainian researchers 
form virtual networks with interna- 
tional colleagues, with intentional 
encouragement from institutions 
and researchers. These efforts cost little but would keep 
these scientists engaged and involved. 

Once the war is over, it is hoped that Ukraine will 
swiftly begin the monumental task of rebuilding. Na- 
tional science academies around the world should ad- 
vocate that international aid to Ukraine be directed 
to rebuilding science infrastructure alongside other 
critical needs such as transportation, energy, and health 
care. Rebuilding Ukrainian science should not concen- 
trate on replicating what was lost, but on equipping the 
country’s scientific enterprise to meet shared 21st-cen- 
tury challenges—such as preparing for future pandem- 
ics, fighting climate change, and sharing the benefits of 
science equally and equitably. 

The stakes of the war in Ukraine are high—the future 
of democracy in Europe is at risk. The global science com- 
munity should not only help guarantee that Ukrainian 
science remains a vital source of national advancement, 
but also ensure that it is part of international science so 
that its values of collaboration, cooperation, and mutual 
trust continue to contribute to a better world. 

—Jerzy Duszynski, Marcia McNutt, Anatoly Zagorodny 
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14.5% 


Portion of the world’s population likely infected by Lyme disease. 
The estimate comes from a meta-analysis in BMJ Global Health 

of 89 studies involving 158,287 people that examined the presence 
of blood antibodies. Prevalence has grown since 2010. 
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Dust grain hits Webb space telescope 


ne mirror segment of NASA’s James Webb Space Telescope was 

struck by a micrometeoroid last month, leading to a “margin- 

ally detectable effect” on its data collection, the agency said last 

week. Launched nearly 6 months ago, Webb has a 6.5-meter, 

gold-coated mirror made of 18 hexagonal segments. Mission 

planners expected many specks of high-speed space dust to 
ding Webb during its lifetime and simulated their impact using spare 
mirror segments. Operators recorded four small strikes during Webb’s 
deployment and commissioning phase. NASA estimated the speck that 
struck it in May was larger, but less than 0.1 millimeter wide. The tele- 
scope is designed to withstand many such hits, and engineers have 
already adjusted the shape and position of the affected mirror segment 
to mitigate distortion from the damage; NASA says the telescope’s per- 
formance still exceeds all mission requirements. 


U.S. nixes funding Russia work 


WAR | The U.S. government last week said 
it will not fund new collaborations with 
research institutions affiliated with Russia’s 
government and individuals working at 
those institutions, in response to Russia’s 
24 February invasion of Ukraine. The 11 June 
announcement comes months after several 
European nations went further, freezing 
existing collaborations. The U.S. policy 
allows existing joint projects to continue 
until completed. Some U.S. institutions 
have already voluntarily canceled existing 
collaborations with Russian universities; 
for example, in February the Massachusetts 
Institute of Technology pulled out of one 
with the Skolkovo Institute of Science and 
Technology, which MIT had helped found. 


Don’t rule out lab leak, panel says 


covib-19 | In yet another attempt to 
determine the origin of the pandemic, 

a new group of experts convened by the 
World Health Organization (WHO) last 
week issued a preliminary report that calls 
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for further study of several ideas, including 
the contentious one that SARS-CoV-2 might 
have escaped from a lab in Wuhan, China. 
An earlier WHO-organized commission 
drew controversy in 2021 by ranking the 
lab-leak notion “extremely unlikely” and not 
a priority for further study. Its successor, the 
Scientific Advisory Group for the Origins 

of Novel Pathogens (SAGO), says the idea 
can’t be ruled out, although “the strongest 
evidence” supports the hypothesis that the 
virus jumped from animals to humans, 
SAGO’s chair, South African virologist 
Marietjie Venter, said at a 9 June press 
conference. China has rejected the lab-leak 
theory, and panel members from China, 
Brazil, and Russia said they see no reason 
to study it further. SAGO also proposes a 
more careful examination of the possibility 
that SARS-CoV-2 jumped into humans at a 
Wuhan market that sold several mamma- 
lian species susceptible to the virus. 


Marine canyon to be preserved 


CONSERVATION | President Joe Biden’s 
administration last week proposed to make 


Hudson Canyon, an underwater chasm 

in the Atlantic Ocean, a national marine 
sanctuary. It would join a handful of other 
federally protected reserves off the eastern 
seaboard of the United States. The largest 
underwater canyon in this region, Hudson 
Canyon is roughly 160 kilometers southeast 
of New York City. It plunges to 4000 meters 
at its deepest point, and its drop below 

the surrounding seabed rivals the Grand 
Canyon’s depth. Hudson Canyon is home 
to more than 200 species of fish and other 
organisms, such as sperm whales, cold 
water corals, and sea turtles, some classified 
as vulnerable, and is dotted by shipwrecks. 
The designation would not automatically 
ban commercial fishing; federal officials 
plan to finalize allowed human activities 
after a 90-day comment period. 


Land-use maps pick up speed 


REMOTE SENSING | Researchers studying 
the ecological and environmental changes 
wrought by humanity now will have access 
to sharper and more frequently updated 
maps of Earth’s surface, categorized by land- 
use types such as trees, built-up areas, and 
bare land. The World Resources Institute 
(WRD and Google last week unveiled 
Dynamic World, a project that publishes 
global, digital land-use maps as frequently 
as every 2 days at resolutions as fine as 

10 meters. By comparison, many countries 


Amap of land types shows the Taal volcano in the 
Philippines before (left) and after it erupted in 2020. 
Green represents trees; gray, bare land. 
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Stars close to the Milky 
Way’s plane and center are 
rich in metals (red). 
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Star mapper provides Milky Way portrait 


he European Space Agency's Gaia satellite has now mapped 
almost 2 billion of the Milky Way’s stars, logging their posi- 
tions, speeds, temperatures, and other parameters—and 
allowing astronomers to chart the Galaxy's structure and 
evolution. Last week, operators released the third major 
trove of data, including lists of 800,000 binary stars, 10 million 
variable stars, and, within the Solar System, 156,000 asteroids 
and rocky bodies. Launched in 2013, Gaia has already uncovered 
new nearby satellite galaxies, fast-moving stars escaping the Milky 


have produced maps with no better than 
30-meter resolution and only update them 
annually, reflecting the time it takes to 
identify land types. Some regional maps are 
revised monthly, but even that pace is too 
slow to capture rapid change from wildfire 
or flooding. The new maps are based on 
imagery from the European Space Agency’s 
Sentinel-2 satellites; Google tailored artificial 
intelligence algorithms to estimate probable 
land-cover types, a method described last 
week in Nature Scientific Data. 


U.K. eyes scrutiny of foreign ties 


POLicy | U.K. universities may be forced 
to declare funding of more than £75,000 
from “foreign actors,” under proposed 
legislation. The law, which aims to protect 
free speech in higher education, would 
also give the government new powers to 
police the effects of university funding 
from countries such as China and Russia. 
It would exempt countries in the European 
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Union and NATO, as well as those belong- 
ing to the Academic Technology Approval 
Scheme, a U.K. program to weed out 
foreign students who may pose a national 
security threat. The move follows concerns 
about foreign influence on U.K. universi- 
ties, including Chinese efforts to block 
discussions of controversial topics, such 

as Taiwan and the treatment of Uighur 
Muslims. The legislation would also ban 
Confucius Institutes—Chinese centers of 
language and culture that have been closed 
in the United States. 


Storm watchers fail to reach orbit 


METEOROLOGY | A rocket launch carrying 
two small hurricane-monitoring satellites 
for NASA ended in disappointment on 

12 June. Rocketmaker Astra said the 
rocket’s second stage malfunctioned after 
liftoff; the CubeSats likely fell into the 
ocean and were destroyed. It was the fifth 
failure out of seven orbital launch attempts 


Way, and evidence of past galaxies that merged with ours. The 
new data show thousands of stars are convulsed by giant, violent 
waves. And, for the first time, the data also include spectra— 
breakdowns of starlight—that point to chemical compositions. 
Stars forge heavier elements and disperse them when they 
explode; the elements are taken up by subsequent generations of 
stars. Mapping that information shows the abundance of heavy 
elements is highest in the Galaxy’s central disk, where stars live 


by the fledgling company. The satellites 
were the first two of six planned CubeSats, 
each the size of a shoebox, in NASA’s 

$30 million TROPICS mission. Mission 
managers intend to use microwave radio- 
meters to see through clouds and measure 
the temperature and moisture of tropical 
storms. The fleet would improve forecasts 
of hurricane intensity by observing storms 
as often as every 50 minutes. The mission 
can still achieve many of its science goals 
with just four satellites, which Astra rock- 
ets are also supposed to launch. 


Rename monkeypox strains? 


INFECTIOUS DISEASES | Identifying 
monkeypox strains by their place of 
origin in Africa is “discriminatory and 
stigmatizing,” says an international group 
of researchers who have urged an over- 
haul of how the fast-spreading virus is 
named. Years ago, researchers divided 
monkeypox viruses into two “clades,” or 
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branches, named after the regions—West 
Africa and the Congo Basin—where many 
were first found. (The two clades differ in 
their genomes and the severity of disease 
caused.) But monkeypox needs a new, 
“practical and neutral system of nomen- 
clature,” the international group argues in 
a preprint posted last week on a virology 
website. They suggest renaming the Congo 
Basin and West African clades as 1 and 2, 
respectively, and creating a separate clade, 
3, for the current human outbreak strains, 
given their differences from those in the 
other two. The outbreak’s origin remains 
unknown. The call to rename the clades 
echoes debates over the names of other 
diseases and pathogens; one such recent 
discussion led to the current nomenclature 
for SARS-CoV-2 variants, with Greek let- 
ters replacing geographical names like the 
Wuhan or South African strain. 


China to grow journal portfolio 


SCIENTIFIC PUBLISHING | China plans to 
launch 50 new English-language academic 
journals in the next year in an ambi- 
tious expansion of its program to create 
world-class publications, Zhang Yuzhuo 
of the China Association for Science and 
Technology (CAST) said at a press confer- 
ence last week. Since 2019, the Chinese 
government has supported establishing 
up to 30 new journals each year. This 
year, CAST is accepting proposals in fields 


ranging from astronomy and artificial 
intelligence to public health and software 
engineering. Each approved new journal 
will get a one-time grant of $74,000 to help 
cover startup costs. The plan has also pro- 
vided grants to 250 existing publications 
to support training staff and upgrading 
websites. The journals’ improving quality 
has attracted more submissions and led to 
rising citation counts, says Xiaotian Chen, 
a librarian at Bradley University. Chen 
says Cell Research, Molecular Plant, and 
National Science Review are examples of 
Chinese journals that have gained interna- 
tional stature. 


U.S. ends travel testing rule 


covip-19 | People bound for the United 
States and its territories no longer need to 
test for the pandemic coronavirus ahead of 
traveling, President Joe Biden’s admin- 
istration announced last week. Since 
January 2021, the Centers for Disease 
Control and Prevention (CDC) had man- 
dated that those flying into the country, 
including U.S. citizens, present a nega- 
tive SARS-CoV-2 test result obtained no 
more than 24 hours prior to their arrival. 
(Those who had been infected with the 
virus in the 90 days prior to travel had to 
show proof they had recovered.) The testing 
requirement could now be lifted because 
“the COVID-19 pandemic has now shifted 
to a new phase,” given broad uptake of 


IN FOCUS Cactus bees (Diadasia rinconis) swarm in a mating ball in Texas ina 
photo that won the Grand Prize in the annual BigPicture competition, run by the 
California Academy of Sciences. Cactus bees are mostly solitary. 
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vaccines and high immunity levels, CDC’s 
announcement stated. An agency official 
said CDC plans to re-evaluate its decision 
in 90 days. It will monitor viral variants of 
concern and reinstate the testing mandate 
if needed. 


Troubled beagle facility to close 


ANIMAL WELFARE | A facility that has bred 
thousands of beagles for medical research 
and been criticized for more than 70 viola- 
tions of the U.S. Animal Welfare Act (AWA) 
will be shut down, its owner, contract 


research organization Inotiv, said this week. 


CEO Robert Leasure cited the growing cost 
of fixing the problems at the Cumberland, 
Virginia, establishment. The decision came 
2 months after Virginia enacted a law that 
prevents facilities from selling dogs or cats 
for research if they sustain a single, serious 
violation of the AWA after June 2023. In 

a first for a research breeding facility, the 
Department of Justice last month sued 
Inotiv subsidiary Envigo, which operates 
the facility, seeking to force AWA compli- 
ance and seized 446 puppies and dogs that 
veterinarians said were “in acute distress.” 
In a 13 June hearing, attorneys for the 
government and the company sparred over 
whether it should be allowed to sell about 
3000 dogs there to research clients or be 
compelled to give them away for adoption. 


NASA steps into UFO hunt 


FRONTIERS | NASA said last week it will 
spend up to $100,000 to gather data rele- 
vant to understanding “unidentified aerial 
phenomena,” more commonly known 

as UFOs. The civilian agency has typi- 
cally eschewed any studies of UFOs; only 
in recent years has it resumed funding 
research in the broader field of the search 
for extraterrestrial intelligence (SETI). The 
project parallels separate efforts by the 
U.S. Department of Defense to document 


and categorize UFO reports, after Congress 


directed it to create a permanent office for 
UFO investigations. NASA’s independent 
study team will be led by astrophysicist 
David Spergel, president of the Simons 
Foundation. Thomas Zurbuchen, NASA’s 
associate administrator for science, 
described the new effort as complemen- 
tary to its existing SETI research, which 
looks for “technosignatures” of advanced 
civilizations in astronomical observations. 
“NASA believes that the tools of scientific 
discovery are powerful and apply here 
also,” Zurbuchen said in a statement. 
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ATLAS (above) and the other particle detectors fed by Europe’s Large Hadron Collider should resume taking data next month. 


The triumph and tragedy of the Higgs boson 


Ten years ago, physicists found what they predicted. Little new has followed 


By Adrian Cho 


decade ago, particle physicists 

thrilled the world. On 4 July 2012, 

6000 researchers working with the 

world’s biggest atom smasher, the 

Large Hadron Collider (LHC) at the 

European particle physics labora- 
tory, CERN, announced they had discov- 
ered the Higgs boson, a massive, fleeting 
particle key to their abstruse explanation of 
how other fundamental particles get their 
mass. The discovery fulfilled a 45-year-old 
prediction, completed a theory called the 
standard model, and thrust physicists into 
the spotlight. 

Then came a long hangover. Before the 
27-kilometer-long ring-shaped LHC started 
to take data in 2010, physicists fretted that 
it might produce the Higgs and nothing 
else, leaving no clue to what lies beyond the 
standard model (Science, 23 March 2007, 
p. 1657). So far, that nightmare scenario is 
coming true. “It’s a bit disappointing,” al- 
lows Barry Barish, a physicist at the Califor- 
nia Institute of Technology. “I thought we 
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would discover supersymmetry,’ the leading 
extension of the standard model. 

It’s too early to despair, many physicists 
say. After 3 years of upgrades, the LHC is now 
powering up for the third of five planned 
runs, and some new particle could emerge 
in the billions of proton-proton collisions it 
will produce every second. In fact, the LHC 
should run for another 16 years, and with 
further upgrades should collect 16 times as 
much data as it already has. All those data 
could reveal subtle signs of novel particles 
and phenomena. 

Still, some researchers say the writing 
is on the wall for collider physics. “If they 
don’t find anything, this field is dead,’ says 
Juan Collar, a physicist at the University of 
Chicago who hunts dark matter in smaller 
experiments. John Ellis, a theorist at King’s 
College London, says hopes of a sudden 
breakthrough have given way to the prospect 
of a long, uncertain grind toward discovery. 
“It’s going to be like pulling teeth, not like 
teeth falling out.” 

Since the 1970s, physicists have been 
locked in a wrestling match with the standard 


model. It holds that ordinary matter consists 
of lightweight particles called up quarks and 
down quarks—which bond in trios to make 
protons and neutrons—along with electrons 
and featherweight particles called electron 
neutrinos. Two sets of heavier particles lurk 
in the vacuum and can be blasted into fleet- 
ing existence in particle collisions. All in- 
teract by exchanging other particles: The 
photon conveys the electromagnetic force, 
the gluon carries the strong force that binds 
quarks, and the massive W and Z bosons 
carry the weak force. 

The standard model describes everything 
scientists have seen at particle colliders so 
far. Yet it cannot be the ultimate theory of 
nature. It leaves out the force of gravity, and 
it doesn’t include mysterious, invisible dark 
matter, which appears to outweigh ordinary 
matter in the universe six to one. 

The LHC was supposed to break that im- 
passe. In its ring, protons circulating in op- 
posite directions crash together at energies 
nearly seven times as high as at any previ- 
ous collider, enabling the LHC to produce 
particles too massive to be made elsewhere. 
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A decade ago many physicists envisioned 
quickly spotting marvels including new 
force-carrying particles or even mini-black 
holes. “One would drown in supersymmetric 
particles,’ recalls Beate Heinemann, director 
of particle physics at the German laboratory 
DESY. Finding the Higgs would take longer, 
physicists predicted. 

Instead, the Higgs appeared in a relatively 
speedy 3 years—in part because it is some- 
what less massive than many physicists 
expected, about 133 times as heavy as a pro- 
ton, which made it easier to produce. And 
10 years after that monumental discovery, no 
other new particle has emerged. 

That dearth has undermined two of physi- 
cists’ cherished ideas. A notion called natu- 
ralness suggested the low mass of the Higgs 
more or less guaranteed the existence of new 
particles within the LHC’s grasp. According 
to quantum mechanics, any particles lurking 
“virtually” in the vacuum will interact with 
real ones and affect their properties. That’s 
exactly how virtual Higgs bosons give other 
particles their mass. 

That physics cuts both ways, however. 
The Higgs boson’s mass ought to be pulled 
dramatically upward by other 
standard model particles in 
the vacuum—especially the 
top quark, a heavier version 
of the up quark that weighs 
184 times as much as the pro- 
ton. That doesn’t happen, so 
theorists have reasoned that 
at least one other new particle 
with a similar mass and just 
the right properties—in particular, a different 
spin—must exist in the vacuum to “naturally” 
counter the effects of the top quark. 

The theoretical concept known as super- 
symmetry would supply such particles. 
For every known standard model particle, 
it posits a heavier partner with a different 
spin. Lurking in the vacuum, those partners 
would not only keep the Higgs’s mass from 
running away, but would also help explain 
how the Higgs field, which pervades the 
vacuum like an unextinguishable electric 
field, came into being. Supersymmetric par- 
ticles might even account for dark matter. 

But instead of those hoped-for particles, 
what have emerged in the past decade are 
tantalizing anomalies—small discrepan- 
cies between observations and standard 
model predictions—that physicists will 
explore in the LHC’s next 3-year run. For 
example, in 2017, physicists working with 
LHCb, one of four large particle detectors 
fed by the LHC, found that B mesons, par- 
ticles that contain a heavy bottom quark, 
decay more often to an electron and a posi- 
tron than to a particle called a muon and 
an antimuon. The standard model says the 
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“If they don’t 
find anything, 
this field is dead.” 


Juan Collar, 
University of Chicago 


two rates should be the same, and the dif- 
ference might be a hint of supersymmetric 
partners, Ellis says. 

Similarly, experiments elsewhere sug- 
gest the muon might be very slightly more 
magnetic than the standard model predicts 
(Science, 9 April 2021, p. 113). That anom- 
aly can be explained by the existence of 
exotic particles called leptoquarks, which 
might already be hiding undetected in the 
LHC’s output, Ellis says. 

The Higgs itself provides other avenues 
of exploration, as any difference between its 
observed and predicted properties would 
signal new physics. For example, in August 
2020, teams of physicists working with the 
LHC’s two biggest detectors, ATLAS and 
CMS, announced that both had spotted 
the Higgs decaying to a muon and an anti- 
muon. If the rate of that hard-to-see de- 
cay varies from predictions, the deviation 
could point to new particles hiding in the 
vacuum, says Marcela Carena, a theorist at 
Fermi National Accelerator Laboratory. 

Those searches likely won’t yield dra- 
matic “Eureka!” moments, however. 
“There’s a shift towards very precise mea- 
surements of subtle effects,” 
Heinemann says. Still, Carena 
says, “I very much doubt that 
in 20 years, I will say, ‘Oh, boy, 
after the Higgs discovery we 
learned nothing new.” 

Others are less sanguine about 
LHC experimenters’ chances. 
“They're facing the desert and 
they don’t know how wide it 
is,’ says Marvin Marshak, a physicist at the 
University of Minnesota, Twin Cities, who 
studies neutrinos using other facilities. Even 
optimists say that if the LHC finds noth- 
ing new, it will be harder to convince the 
governments of the world to build the next 
bigger, more expensive collider to keep the 
field going. 

For now, many physicists at the LHC are 
just excited to get back to smashing pro- 
tons. During the past 3 years, scientists 
have upgraded the detectors and reworked 
the lower energy accelerators that feed 
the collider. The LHC should now run at 
a more constant collision rate, effectively 
increasing the flow of data by as much as 
50%, says Mike Lamont, director of accel- 
erators and beams at CERN. 

Accelerator physicists have been slowly 
tuning up the LHC’s beams for months, 
Lamont says. Only when the beams are 
sufficiently stable will they turn on the 
detectors and resume taking data. Those 
switches should flip on 5 July, 10 years and 
1 day after the announcement of the Higgs 
discovery, Lamont says. “It’s good to be go- 
ing into some sustained running.” 


INFECTIOUS DISEASE 


Ancient DNA 
reveals Black 
Death source 


Graves in Kyrgyzstan hold 
early victims of plague that 
swept medieval Europe 


By Ann Gibbons 


he Syriac engraving on the medieval 

tombstone was tantalizing: “This is 

the tomb of the believer Sanmaq. [He] 

died of pestilence.” Sanmaq, who was 

buried in 1338 near Lake Issyk Kul 

in what is now northern Kyrgyzstan, 
was one of many victims of the unnamed 
plague. By scrutinizing field notes and more 
photos from the Russian team that had ex- 
cavated the graves in the 1880s, historian 
Philip Slavin found that at least 118 people 
from Sanmaq’s Central Asian trading com- 
munity died in the epidemic. 

Slavin was on the trail of the origin of the 
Black Death, which devastated Europe a 
decade after the Kyrgyzstan burials. But he 
knew the medieval diagnosis of “pestilence” 
encompassed many horrific diseases. “I was 
almost 100% certain it was the beginning of 
the Black Death,” says Slavin, a medieval his- 
torian at the University of Stirling. “But there 
was no way to prove it without DNA.” 

Now, Slavin is senior author of a new 
study of ancient DNA from the “pestilence” 
victims showing they were indeed infected 
with the bacterium, Yersinia pestis, that 
caused the Black Death. The strain that 
killed them was ancestral to all the strains 
that rampaged across Europe a decade later 
and continued to kill for the next 500 years. 
The bacterium jumped from rodents to hu- 
mans just before the Kyrgyzstan burials, 
perhaps after sudden changes in rainfall or 
temperature, the researchers propose this 
week in Nature. 

“This is the place where it all started— 
the Wuhan of the Black Death,” says se- 
nior author and paleogeneticist Johannes 
Krause of the Max Planck Institute for Evo- 
lutionary Anthropology. 

The finding confirms some other research- 
ers’ hunches about Central Asia as a source 
for the Black Death strain, and pinpoints a 
precise time and place. “There’s not much 
doubt about it—[the region is] where you 
have lots of reservoirs of the plague,’ says 
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A decade ago many physicists envisioned 
quickly spotting marvels including new 
force-carrying particles or even mini-black 
holes. “One would drown in supersymmetric 
particles,’ recalls Beate Heinemann, director 
of particle physics at the German laboratory 
DESY. Finding the Higgs would take longer, 
physicists predicted. 

Instead, the Higgs appeared in a relatively 
speedy 3 years—in part because it is some- 
what less massive than many physicists 
expected, about 133 times as heavy as a pro- 
ton, which made it easier to produce. And 
10 years after that monumental discovery, no 
other new particle has emerged. 

That dearth has undermined two of physi- 
cists’ cherished ideas. A notion called natu- 
ralness suggested the low mass of the Higgs 
more or less guaranteed the existence of new 
particles within the LHC’s grasp. According 
to quantum mechanics, any particles lurking 
“virtually” in the vacuum will interact with 
real ones and affect their properties. That’s 
exactly how virtual Higgs bosons give other 
particles their mass. 

That physics cuts both ways, however. 
The Higgs boson’s mass ought to be pulled 
dramatically upward by other 
standard model particles in 
the vacuum—especially the 
top quark, a heavier version 
of the up quark that weighs 
184 times as much as the pro- 
ton. That doesn’t happen, so 
theorists have reasoned that 
at least one other new particle 
with a similar mass and just 
the right properties—in particular, a different 
spin—must exist in the vacuum to “naturally” 
counter the effects of the top quark. 

The theoretical concept known as super- 
symmetry would supply such particles. 
For every known standard model particle, 
it posits a heavier partner with a different 
spin. Lurking in the vacuum, those partners 
would not only keep the Higgs’s mass from 
running away, but would also help explain 
how the Higgs field, which pervades the 
vacuum like an unextinguishable electric 
field, came into being. Supersymmetric par- 
ticles might even account for dark matter. 

But instead of those hoped-for particles, 
what have emerged in the past decade are 
tantalizing anomalies—small discrepan- 
cies between observations and standard 
model predictions—that physicists will 
explore in the LHC’s next 3-year run. For 
example, in 2017, physicists working with 
LHCb, one of four large particle detectors 
fed by the LHC, found that B mesons, par- 
ticles that contain a heavy bottom quark, 
decay more often to an electron and a posi- 
tron than to a particle called a muon and 
an antimuon. The standard model says the 
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two rates should be the same, and the dif- 
ference might be a hint of supersymmetric 
partners, Ellis says. 

Similarly, experiments elsewhere sug- 
gest the muon might be very slightly more 
magnetic than the standard model predicts 
(Science, 9 April 2021, p. 113). That anom- 
aly can be explained by the existence of 
exotic particles called leptoquarks, which 
might already be hiding undetected in the 
LHC’s output, Ellis says. 

The Higgs itself provides other avenues 
of exploration, as any difference between its 
observed and predicted properties would 
signal new physics. For example, in August 
2020, teams of physicists working with the 
LHC’s two biggest detectors, ATLAS and 
CMS, announced that both had spotted 
the Higgs decaying to a muon and an anti- 
muon. If the rate of that hard-to-see de- 
cay varies from predictions, the deviation 
could point to new particles hiding in the 
vacuum, says Marcela Carena, a theorist at 
Fermi National Accelerator Laboratory. 

Those searches likely won’t yield dra- 
matic “Eureka!” moments, however. 
“There’s a shift towards very precise mea- 
surements of subtle effects,” 
Heinemann says. Still, Carena 
says, “I very much doubt that 
in 20 years, I will say, ‘Oh, boy, 
after the Higgs discovery we 
learned nothing new.” 

Others are less sanguine about 
LHC experimenters’ chances. 
“They're facing the desert and 
they don’t know how wide it 
is,’ says Marvin Marshak, a physicist at the 
University of Minnesota, Twin Cities, who 
studies neutrinos using other facilities. Even 
optimists say that if the LHC finds noth- 
ing new, it will be harder to convince the 
governments of the world to build the next 
bigger, more expensive collider to keep the 
field going. 

For now, many physicists at the LHC are 
just excited to get back to smashing pro- 
tons. During the past 3 years, scientists 
have upgraded the detectors and reworked 
the lower energy accelerators that feed 
the collider. The LHC should now run at 
a more constant collision rate, effectively 
increasing the flow of data by as much as 
50%, says Mike Lamont, director of accel- 
erators and beams at CERN. 

Accelerator physicists have been slowly 
tuning up the LHC’s beams for months, 
Lamont says. Only when the beams are 
sufficiently stable will they turn on the 
detectors and resume taking data. Those 
switches should flip on 5 July, 10 years and 
1 day after the announcement of the Higgs 
discovery, Lamont says. “It’s good to be go- 
ing into some sustained running.” 
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he Syriac engraving on the medieval 

tombstone was tantalizing: “This is 

the tomb of the believer Sanmaq. [He] 

died of pestilence.” Sanmaq, who was 

buried in 1338 near Lake Issyk Kul 

in what is now northern Kyrgyzstan, 
was one of many victims of the unnamed 
plague. By scrutinizing field notes and more 
photos from the Russian team that had ex- 
cavated the graves in the 1880s, historian 
Philip Slavin found that at least 118 people 
from Sanmaq’s Central Asian trading com- 
munity died in the epidemic. 

Slavin was on the trail of the origin of the 
Black Death, which devastated Europe a 
decade after the Kyrgyzstan burials. But he 
knew the medieval diagnosis of “pestilence” 
encompassed many horrific diseases. “I was 
almost 100% certain it was the beginning of 
the Black Death,” says Slavin, a medieval his- 
torian at the University of Stirling. “But there 
was no way to prove it without DNA.” 

Now, Slavin is senior author of a new 
study of ancient DNA from the “pestilence” 
victims showing they were indeed infected 
with the bacterium, Yersinia pestis, that 
caused the Black Death. The strain that 
killed them was ancestral to all the strains 
that rampaged across Europe a decade later 
and continued to kill for the next 500 years. 
The bacterium jumped from rodents to hu- 
mans just before the Kyrgyzstan burials, 
perhaps after sudden changes in rainfall or 
temperature, the researchers propose this 
week in Nature. 

“This is the place where it all started— 
the Wuhan of the Black Death,” says se- 
nior author and paleogeneticist Johannes 
Krause of the Max Planck Institute for Evo- 
lutionary Anthropology. 

The finding confirms some other research- 
ers’ hunches about Central Asia as a source 
for the Black Death strain, and pinpoints a 
precise time and place. “There’s not much 
doubt about it—[the region is] where you 
have lots of reservoirs of the plague,’ says 
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physical anthropologist Barbara Bramanti of 
the University of Ferrara. 

In European historical accounts, the Black 
Death appears first in 1346 at ports on the 
Black Sea. Within a year it was in Europe, 
where scholars estimate it killed more than 
half of the population by 1353. In 1894, 
microbiologists identified ¥Y pestis as the 
cause. Ever since, they have debated where 
and when the deadly strain was born, consid- 
ering China, Central Asia, India, and Genghis 
Khan’s armies marching from Mongolia. 

In 2020, a new analysis of more than 1300 
modern and ancient genomes of Y. pestis 
narrowed the options. A team led by micro- 
biologist Mark Achtman of the University 
of Warwick used a new software tool to sort 
all known strains of ¥. pestis from humans 
and host animals into a family tree showing 
their evolution over 5500 years, starting with 
strains that were not closely related to the 
Black Death strain. 

One branch of the tree underwent a “big 
bang” explosion of diversity at the time of 
the Black Death, creating a starlike pattern 
of four new lineages of ¥. pestis whose de- 
scendant strains still persist in 40 species of 
rodents around the world. One of those lin- 
eages was the source of the Black Death and 
later outbreaks in Europe until the 18th cen- 
tury. The ancestral strain of this lineage was 
“literally the mother of them all,’ Krause says. 

Geneticists knew this mother strain did 
not arise in Europe, because the strains 


found in Black Death victims there dif- 
fered by two mutations from the putative 
ancestral genome, says paleogeneticist 
Maria Spyrou of the University of Tiibin- 
gen, who had been Krause’s postdoc. “We 
knew the European genomes were very 
close to origins of the Black Death, but 
not quite there,” she says. Several teams 
suspected the source was in Central Asia, 
where strains from rodents were the clos- 
est genetic match to the mother genome. 
But no one had DNA data on strains from 
human victims of the right time period. 

Then Krause and Spyrou heard Slavin 
give a talk about the tombstones. When he 
reported that the people had died of “pes- 
tilence,” they each immediately thought, 
“We should do DNA!” Krause recalls. 

Working with Slavin and Russian collab- 
orators including Valeri Khartanovich of 
the Peter the Great Museum of Anthropo- 
logy and Ethnography, where the Issyk Kul 
skulls were stored, Spyrou extracted DNA 
from the pulp of seven individuals’ teeth 
and found three were infected with Y. 
pestis. She was able to reconstruct a high- 
quality genome of the ancient strain that 
killed them. 

That strain “fell exactly on the origin 
point of that big bang event” in the evo- 
lution of ¥. pestis, Spyrou says. “That was 
incredibly exciting.” 

The strain was closely related to ones 
found in rodents near Issyk Kul today. 


Deadly spread 


Anew study pinpoints the first known cases of the plague that caused the Black Death, in people buried in 1338 
near Lake Issyk Kul in today's Kyrgyzstan. A decade later, bubonic plague had devastated Europe. 
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The authors suggest it spilled over to hu- 
mans, perhaps from a marmot, which are 
abundant in the Tian Shan mountain re- 
gion of northern Kyrgyzstan, southern 
Kazakhstan, and northwestern China. 
Sudden changes in rainfall or temperature 
could have led to surges in local rodent 
populations and the fleas or other insects 
they harbor. More rodents and their pests 
meant more opportunities to hop to a new 
host—humans—and adapt to it, says popu- 
lation biologist Nils Christian-Stenseth of 
the University of Oslo, who has shown a 
correlation between outbreaks of plague 
and warm, wet weather in Central Asia. He 
adds: “There are many good possibilities 
for plague reservoirs; you have the great 
gerbils, marmots, voles.” 

The remaining mystery, he says, is how 
the Black Death traveled 3500 kilometers 
from Central Asia to the Black Sea, where 
historical accounts describe the Mongolian 
army hurling the bodies of plague victims 
into the besieged city of Caffa in Crimea in 
1346 in an early form of biological warfare. 

The meticulous archaeological records for 
each Kyrgyzstan grave offer hints, Slavin says. 
Many people were buried with pearls, coins, 
and other goods from the Indian Ocean, the 
Mediterranean, and Iran; some were appar- 
ently traders. As they traveled, their camel 
wagons may have harbored rats and fleas, 
long considered likely vectors for plague. 

Another paper, in Nature Communica- 
tions last month, suggests rats were per- 
fectly positioned to help spread the Black 
Death. A team led by archaeologist David 
Orton of the University of York used the di- 
versity of ancient DNA in rat bones to trace 
the ups and downs of black rat populations 
through history. In Europe, one population 
collapsed with the fall of the Roman Em- 
pire but was replaced by another in the 
13th century, when growing cities offered 
new food and shelter for the rodents. Black 
rats and their fleas were everywhere at 
that time, Krause says, especially aboard 
ships traveling between the Black Sea and 
Mediterranean ports—the route the Black 
Death evidently followed. 

Meanwhile, the Issyk Kul graveyard is 
giving names and identities to the first 
known victims of the Black Death. “To ac- 
tually have Y. pestis from incredibly well 
dated burials is really exciting,’ says bio- 
archaeologist Sharon De Witte of the Uni- 
versity of South Carolina, Columbia. “We 
can clarify what other disease they were 
infected with and look at the biosocial fac- 
tors that might have shaped risk of death 
in that first wave.” 

As for Slavin, he’s still marveling at the 
discovery. “This was one of my dreams, to 
solve this outstanding puzzle.” 
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Studies tying weather extremes 
to global warming gain rigor 


Record-shattering events spur climate attribution advances 


By Paul Voosen 


n June 2021, a jet stream charged with 

heat and chaotic energy from a nearby 

cyclone stalled over the Pacific North- 

west. The mass of trapped air baked the 

already hot landscape below to a record 

49.6°C. More than 1000 people died from 
heat exposure. 

Scientists quickly began working to figure 
out how much of the blame for the heat wave 
could be laid to global warming. But the heat 
was so unusual, the weather so weird, that it 
broke their methods. “It challenged our tech- 
niques, our climate models, and our statisti- 
cal analysis methods,’ says Michael Wehner, 
a climate scientist at Lawrence Berkeley 
National Laboratory who participates in the 
World Weather Attribution (WWA) initiative. 
WWA ultimately issued a statement, find- 
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events so extreme that there is no historical 
record for comparison. 
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The 2021 Pacific Northwest heat wave was unique, 
making it tough to assess climate change’s role. 


push for wealthy countries to pay for their 
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instances, the model produced a heat wave 
resembling the real event, though not quite 
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Other techniques forgo models entirely. 
Davide Faranda, a climate scientist at the 
University of Paris-Saclay, compares recent 
records of sea level air pressure to records 
from 1950-79, when warming was only get- 
ting started. He says these pressure analogs 
reflect large-scale air flows that can drive ex- 
treme events such as the Pacific Northwest 
heat wave. The method ensures that com- 
parisons are like-to-like, unlike the model 
techniques, which might sometimes hinge 
on comparisons of regional rainfall, which 
can have diverse causes—a thunderstorm or 
a hurricane, for example. And it identifies 
when a weather event is truly new, resulting 
from chaotic pressure flows unseen in past 
records—which makes it statistically impos- 
sible to say whether climate change made 
the event more likely. In a preprint posted 
in February, Faranda and colleagues applied 
the method to maps of pressure leading up 
to a series of 2021 extremes, including flood- 
ing in Germany. They found the rainfall 
from similar extreme events has increased 
in modern times, a sign that warming made 
the flooding worse. 

Finally, one group of researchers, led 
by Nicholas Leach, a graduate student at 
the University of Oxford, is betting that 
detailed weather forecasting models can 
give a more precise picture of global warm- 
ing’s role in weather than coarser climate 
models. They simulated the 2021 Pacific 
Northwest heat wave with the world- 
leading model of the European Centre for 
Medium-Range Weather Forecasts. After 
lowering carbon dioxide levels and remov- 
ing human-driven heat from the ocean, 
the model reproduced a similar heat wave, 
although the highs were 2°C or so cooler 
than in the actual event, the team reported 
last month at the EGU meeting. The mod- 
el’s many runs for each scenario will allow 
them to assess whether climate change 
made the heat wave more likely. But WWA’s 
statement that the event was “virtually im- 
possible” without warming was likely an 
overreach, Leach says. “We don’t find that 
in our model.” 

Such advances are encouraging govern- 
ment agencies to move into attribution stud- 
ies. The National Oceanic and Atmospheric 
Administration, for example, has funded a 
pilot study on rapid attribution using its op- 
erational forecast model and climate models. 
The effort will seek to tease out the influence 
not only of global warming, but also of large- 
scale weather patterns like El Nino. Thiery 
can see a day coming where weather fore- 
casters might incorporate an attribution like 
Leach’s into their everyday work. “You can 
imagine a weather forecaster saying this heat 
wave is coming and it’s 2°C warmer because 
of climate change.” 
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GENOMICS 


Upstart DNA sequencers could 
be a ‘game changer’ 


Increasing potential for fast, cheap genomes may break 
open biology’s bottleneck and broaden clinical uses 


By Elizabeth Pennisi 


or DNA sequencing, this “is the year 

of the big shake-up,” says Michael 

Snyder, a systems biologist at Stan- 

ford University. Sequencing is cru- 

cial to fields from basic biology to 

virology to human evolution, and its 
importance keeps growing. Clinicians are 
clamoring to harness it for early detection 
of cancer and other diseases, and biologists 
are finding ever more ways to use genom- 
ics to study single cells. But for years, most 
sequencing has relied on machines from a 
single company, Illumina. 

Last week, however, a young company 
called Ultima Genomics said at a meeting 
in Orlando, Florida, that with new twists 
on existing technologies, it could provide 
human genomes for $100 a pop, one-fifth 
the going rate. Several other companies 
also promised faster, cheaper sequencing 
at the same meeting, Advances in Genome 
Biology and Technology. This year, key 
patents protecting Illumina’s sequencing 
technology will expire, paving the way for 
more competition, including from a Chi- 
nese company, MGI, which last week an- 
nounced it would begin to sell its machines 
in the United States this summer. “We may 
be on the brink of the next revolution in 
sequencing,” says Beth Shapiro, an evolu- 
tionary biologist at the University of Cali- 
fornia, Santa Cruz (UCSC). 

Most sequencing companies, including 
Illumina, which has controlled 80% of the 
global market, depend on “sequencing by 
synthesis.” The DNA to be deciphered is 
separated into single strands, which are 
usually chopped into short pieces and 
mounted on a surface—often a tiny bead— 
in a container called a flow cell. Each sin- 
gle strand fragment serves as a template to 
guide the synthesis of a strand with com- 
plementary bases, supplied one at a time 
to channels of beads. Because each added 
base has been modified to glow, a camera 
can record where it attaches—and hence 
the identity of the corresponding base on 
the original strand. The steps are repeated 
until the new DNA strand is complete. 

Ultima streamlined the process by 


spraying the DNA-laden beads by the bil- 
lions onto round silicon wafers the size 
of dessert plates. Nozzles above each wa- 
fer gently squirt out bases and other re- 
agents, which spread thinly and evenly 
across the wafer as it rotates, reducing 
the amount of these expensive materi- 
als needed. Instead of moving back and 
forth across under the camera, the disk 
moves in a spiral, akin to how a compact 
disk is played, which speeds up imag- 
ing. It’s “clever engineering [that] avoids 
a lot of complex plumbing,” says Mark 
Akeson, a molecular biologist at UCSC. A 
neural network program rapidly turns im- 
aging data into a sequence. 

The sequencing chemistry is different as 
well. Only a few bases carry fluorescent tags, 


The more than 3 billion letters in the human 
genome can now be sequenced for $100, several 
companies claim. 


reducing costs. Moreover, the bases lack the 
usual stop signal, which ensures no extra 
bases latch on. Without these “termina- 
tors,” the growing chain can sometimes add 
multiple bases at once, speeding the pro- 
cess. “Many of these innovations are used 
elsewhere, but they seem to have come to- 
gether very nicely here,” says Jay Shendure, 


17 JUNE 2022 * VOL 376 ISSUE 6599 1257 


NEWS | IN DEPTH 


PHOTO: JUSUN/ISTOCK.COM 


Other techniques forgo models entirely. 
Davide Faranda, a climate scientist at the 
University of Paris-Saclay, compares recent 
records of sea level air pressure to records 
from 1950-79, when warming was only get- 
ting started. He says these pressure analogs 
reflect large-scale air flows that can drive ex- 
treme events such as the Pacific Northwest 
heat wave. The method ensures that com- 
parisons are like-to-like, unlike the model 
techniques, which might sometimes hinge 
on comparisons of regional rainfall, which 
can have diverse causes—a thunderstorm or 
a hurricane, for example. And it identifies 
when a weather event is truly new, resulting 
from chaotic pressure flows unseen in past 
records—which makes it statistically impos- 
sible to say whether climate change made 
the event more likely. In a preprint posted 
in February, Faranda and colleagues applied 
the method to maps of pressure leading up 
to a series of 2021 extremes, including flood- 
ing in Germany. They found the rainfall 
from similar extreme events has increased 
in modern times, a sign that warming made 
the flooding worse. 

Finally, one group of researchers, led 
by Nicholas Leach, a graduate student at 
the University of Oxford, is betting that 
detailed weather forecasting models can 
give a more precise picture of global warm- 
ing’s role in weather than coarser climate 
models. They simulated the 2021 Pacific 
Northwest heat wave with the world- 
leading model of the European Centre for 
Medium-Range Weather Forecasts. After 
lowering carbon dioxide levels and remov- 
ing human-driven heat from the ocean, 
the model reproduced a similar heat wave, 
although the highs were 2°C or so cooler 
than in the actual event, the team reported 
last month at the EGU meeting. The mod- 
el’s many runs for each scenario will allow 
them to assess whether climate change 
made the heat wave more likely. But WWA’s 
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a geneticist and technology developer at the 
University of Washington (UW), Seattle. 

Ultima CEO Gilad Almogy and his col- 
leagues demonstrated the technology’s po- 
tential in four preprints posted in late May 
on bioRxiv. In one, they and colleagues at 
the Broad Institute of MIT and Harvard 
used their machine to sequence more than 
224 already-sequenced human genomes 
and found their results on par with previ- 
ous work. The three other studies showed 
the technology can evaluate a single cell’s 
repertoire of expressed genes, the effects of 
mutations, and epigenetics—chemical mod- 
ifications of DNA that affect gene activity. 

Until now, cost has limited such single- 
cell studies, causing a bottleneck in research. 
But Snyder found Ultima’s low-cost approach 
enabled him to sequence multiple colon can- 
cer cells to document how one DNA modi- 
fication, methylation, changes 
as colon cancer develops. 

In another preprint, 
Joshua Levin and his Broad 
Institute colleagues tested 
the ability of the Ultima 
technology to identify active 
genes in single blood cells as 


“We may be on 
the brink of 
the next revolution 

in sequencing.” 


Pachter and others also take issue with 
the touted $100 cost. That figure only cov- 
ers reagents, not the labor, pre- and post- 
sequencing steps, and initial outlay for the 
machine, the price of which has not been 
released. Even if the $100 figure is real, it 
may not be unique: Other companies are 
also promising $100 per human genome. 

One is MGI, a subsidiary of Chinese se- 
quencing giant BGI. MGI’s technology is 
similar to Ilumina’s, but it increases accu- 
racy by adding all four bases at once as it 
sequences DNA. To track which bases are 
incorporated, it uses antibodies, which are 
brighter and less expensive than fluorescent 
dyes. Illumina, too, is promising lower costs, 
and at the meeting it introduced new chem- 
istries to increase accuracy and flexibility. 

For this bargain rate to be realized, 
Ultima and MGI both require filling their 
sequencers to capacity with 
hundreds of genomes. But 
high-throughput sequencing 
“Gs not always good for clini- 
cal practice even if it is good 
economics,” says Greg Elgar, a 
genome biologist at Genomics 
England, because sometimes 


indicated by the genes’ RNA Beth Shapiro, a physician needs just one or 
transcripts. The team found University of California, a few people’s genomes ana- 
Ultima’s machine identified Santa Cruz lyzed. Other companies with 


those genes about as well as 
Tllumina’s did. And, he adds, “It’s a game 
changer due to the lower cost.” 

Florence Chardon, a UW genomics grad- 
uate student who modifies DNA with the 
genome editor CRISPR, is excited by that 
prospect. “The less expensive [sequencing] 
gets, the more accessible this kind of re- 
search is to more labs and more people,” 
she says. 

But Lior Pachter, a computational bio- 
logist at the California Institute of Tech- 
nology, has reservations about the new 
technology. He and graduate student A. 
Sina Booeshaghi looked at one of the most 
active genes in blood cells from Levin's 
team, a possible cancer biomarker also 
known for producing a protein athletes 
sometimes inject to illegally enhance 
their performance. The Ultima technology 
sometimes missed the active gene, Pachter 
says. The “error rate was very high, and the 
performance was very poor.” 

The gene has a stretch in which the same 
base is repeated eight times, and Ultima ad- 
mits long repeats can undermine the accu- 
racy of its reads. Looking elsewhere in the 
Ultima sequence, Pachter found errors when 
one base was repeated just three times. He 
notes that a human genome contains at least 
14 million of these so-called homopolymers. 
Still, he says, “For some applications, you 
don’t need perfect sequences.” 
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new flow cells and chemis- 
tries can economically sequence small num- 
bers of genomes. At last week’s meeting, 
Element Biosciences CEO Molly He reported 
the company is now shipping benchtop se- 
quencers that can sequence three human 
genomes at a time, at a cost of $560 each. 
Another company, Singular Genomics, also 
promises benchtop technology that doesn’t 
require high throughput for cost savings. 

These machines, like those from Illumina, 
MGI, and Ultima, all decipher short frag- 
ments of DNA. But for the past 7 years, two 
companies, Pacific Biosciences and Oxford 
Nanopore Technologies, have worked on se- 
quencing “long reads,” thousands of bases 
long, which leave fewer partial sequences to 
piece together into a full genome. The tech- 
nologies “can sequence the native DNA mol- 
ecule, in all its glory,’ Elgar says. They have 
struggled with low accuracy and high cost, 
but he says they are on their way to becom- 
ing practical tools. 

Don’t count the sequencing giant II- 
lumina out just yet. Its scientists “prob- 
ably have kept a couple of cards in their 
back pocket” to keep their position in the 
market strong, says Albert Vilella, a bio- 
informatician and genomics consultant in 
Cambridge, England. Nonetheless, Illumina 
faces unprecedented competition, he adds. 
“It’s time to look at the [DNA sequencing] 
landscape with fresh eyes.” 


INFECTIOUS DISEASE 


Monkeypox 
could establish 
new reservoirs 
in animals 


Concern grows that virus 
will get footholds in species 
outside Africa 


By Jon Cohen 


leven days after being bitten by one 
of her pet prairie dogs, a 3-year-old 
girl in Wisconsin on 24 May 2003 
became the first person outside of 
Africa to be diagnosed with monkey- 
pox. Two months later, her parents 
and 69 other people in the United States 
had suspected or confirmed cases of this 
disease, which is caused by a relative of 
the much deadlier smallpox virus. The 
monkeypox virus is endemic in parts of 
Africa, and rodents imported from Ghana 
had apparently infected captive prairie 
dogs from North America when an animal 
distributor in Texas housed them together. 

The outbreak now underway has af- 
fected more people outside of Africa than 
ever before—more than 1800 cases as of 
14 June, on multiple continents, many of 
them men who have sex with men. But like 
the 2003 episode, today’s surge has raised 
a possibility that makes researchers gulp: 
Monkeypox virus could take up perma- 
nent residence in wildlife outside of Africa, 
forming a reservoir that could lead to re- 
peated human outbreaks. 

No animal reservoir currently exists out- 
side of Africa, but the U.S. outbreak of 2003 
was a close call, some scientists suspect, es- 
pecially because nearly 300 of the animals 
from Ghana and the exposed prairie dogs 
were never found. “We narrowly escaped 
having monkeypox establish itself in a wild 
animal population” in North America, sug- 
gests Anne Rimoin, an epidemiologist at 
the University of California, Los Angeles, 
who long has studied the disease in the 
Democratic Republic of the Congo (DRC). 

In the end, however, surveys of wild ani- 
mals in Wisconsin and Illinois never found 
monkeypox virus, none of the infected hu- 
mans passed on the disease to other peo- 
ple, and worries about this exotic outbreak 
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outside Africa 


By Jon Cohen 


leven days after being bitten by one 
of her pet prairie dogs, a 3-year-old 
girl in Wisconsin on 24 May 2003 
became the first person outside of 
Africa to be diagnosed with monkey- 
pox. Two months later, her parents 
and 69 other people in the United States 
had suspected or confirmed cases of this 
disease, which is caused by a relative of 
the much deadlier smallpox virus. The 
monkeypox virus is endemic in parts of 
Africa, and rodents imported from Ghana 
had apparently infected captive prairie 
dogs from North America when an animal 
distributor in Texas housed them together. 

The outbreak now underway has af- 
fected more people outside of Africa than 
ever before—more than 1800 cases as of 
14 June, on multiple continents, many of 
them men who have sex with men. But like 
the 2003 episode, today’s surge has raised 
a possibility that makes researchers gulp: 
Monkeypox virus could take up perma- 
nent residence in wildlife outside of Africa, 
forming a reservoir that could lead to re- 
peated human outbreaks. 

No animal reservoir currently exists out- 
side of Africa, but the U.S. outbreak of 2003 
was a close call, some scientists suspect, es- 
pecially because nearly 300 of the animals 
from Ghana and the exposed prairie dogs 
were never found. “We narrowly escaped 
having monkeypox establish itself in a wild 
animal population” in North America, sug- 
gests Anne Rimoin, an epidemiologist at 
the University of California, Los Angeles, 
who long has studied the disease in the 
Democratic Republic of the Congo (DRC). 

In the end, however, surveys of wild ani- 
mals in Wisconsin and Illinois never found 
monkeypox virus, none of the infected hu- 
mans passed on the disease to other peo- 
ple, and worries about this exotic outbreak 
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evaporated. Will North and South America, 
Europe, Asia, and Australia—all of which 
have reported monkeypox cases in this 
outbreak—be similarly fortunate this time? 

Viruses frequently pingpong between 
humans and other species. Although 
COVID-19 is widely thought to have re- 
sulted from SARS-CoV-2 jumping from 
a bat or other host into people, humans 
have, in “reverse zoonoses,” also infected 
white-tailed deer, minks, cats, and dogs 
with the virus. One study in Ohio found 
antibodies to SARS-CoV-2 in more than 
one-third of 360 wild deer sampled. And 
in past centuries, when humans carried 
plague and yellow fever to new continents, 
those pathogens created reservoirs in ro- 
dents and monkeys, respectively—which 
later infected humans again. 

As this outbreak of monkeypox expands, 
the virus has an unprecedented 
opportunity to establish itself 
in non-African species, which 
could infect humans and pro- 
vide greater opportunity for 
more dangerous variants to 
evolve. “Monkeypox reservoirs 
in wild animals outside of Africa 
is a scary scenario,” says Bertram 
Jacobs, a virologist at Arizona 
State University (ASU), Tempe, 
who studies vaccinia, the pox- 
virus that served as the smallpox 
vaccine and helped eradicate that 
devastating virus from humans. 

Public health officials in sev- 
eral countries have advised peo- 
ple who have monkeypox lesions 
to avoid contact with their pets. 
Some 80% of the cases have oc- 
curred in Europe, and the Euro- 
pean Food Safety Authority said 
no pets or wild animals had been infected as 
of 24 May. But it added that “close collabora- 
tion between human and veterinary public 
health authorities is needed to manage ex- 
posed pets and prevent the disease from be- 
ing transmitted to wildlife.” 

The possibility that humans infected 
with monkeypox virus will spread it to 
wildlife outside of Africa “warrants seri- 
ous concern,” says William Karesh, a vet- 
erinarian at the EcoHealth Alliance who 
on 2 June spoke about this possibility at 
a consultation on monkeypox research or- 
ganized by the World Health Organization. 
For now, he says, the limited number of 
human cases reduces the odds. But pet ro- 
dents are a particular worry, as is the sheer 
number of wild ones—they make up 40% 
of all mammals—that frequently raid trash 
and could become infected by contami- 
nated waste. “That’s a lot of opportunity,” 
he says. 
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Studies have yet to pinpoint the Afri- 
can reservoir of the monkeypox virus. Al- 
though a lab in Copenhagen, Denmark, in 
1958 first identified it in research monkeys 
from Asia, scientists now believe the pri- 
mates caught it from an African source. 
Since the first human case was reported in 
1970, in the DRC (then Zaire), all could be 
tied to the virus spilling over from animals 
in Africa. 

So far, however, only six wild animals 
trapped in Africa have yielded the vi- 
rus: three rope squirrels, a Gambian rat, 
a shrew, and a sooty mangabey monkey. 
Antibodies to the monkeypox virus are 
most abundant in African squirrels. “We 
still poorly understand the current reser- 
voir other than it’s rodents,’ says Grant 
McFadden, a poxvirus researcher who is 
also based at ASU. 
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The Kautzer family, shown here in June 2003 with their prairie dog Chuckles, were 
all infected by monkeypox 1 month earlier by a second pet prairie dog that died. 


But it’s clear monkeypox can infect many 
other kinds of animals in the wild and cap- 
tivity. A 1964 outbreak in a Rotterdam, 
Netherlands, zoo sickened giant anteaters, 
orangutans, gorillas, chimpanzees, a gibbon, 
and a marmoset. Researchers have intention- 
ally infected many lab animals, including 
rabbits, hamsters, guinea pigs, and chickens, 
although the virus doesn’t reliably cause dis- 
ease in several of them. 

For many viruses, a lock-and-key relation- 
ship between viral surface proteins and recep- 
tors on host cells determines which animals 
it can infect; the spike protein of SARS-CoV-2, 
for example, latches onto angiotensin- 
converting enzyme 2, a protein that studs a 
variety of cells in humans, minks, cats, and 
many other species. But poxviruses don’t 
seem to require specific host receptors, en- 
abling many to infect a wide range of mam- 
malian cells. Vaccinia, the smallpox vaccine 
virus, can even infect fruit flies in addition 


to cows and people, notes David Evans, a 
poxvirus researcher at the University of Al- 
berta, Edmonton. Bernard Moss, a virologist 
at the U.S. National Institute of Allergy and 
Infectious Diseases (NIAID), has posited that 
some poxviruses have proteins on their sur- 
faces that form a “hydrophobic face,” a water- 
repelling area that can bind nonspecifically 
to hydrophilic cell membranes and initiate 
the infection process. 

But whether a poxvirus can thrive in an 
infected cell and, ultimately persist in a spe- 
cies to create a reservoir depends on how well 
it fends off the host’s immune attacks. Pox- 
viruses have a relatively large complement of 
genes, about 200, and roughly half undermine 
a host’s immune response. “Some viruses run 
and hide or are stealthy, avoiding direct con- 
tact with elements of the immune system,” 
McFadden says. “Poxviruses by and large 
stand up and fight.” 

Their defense against host im- 
munity appears to rely heavily 
on a family of genes scattered 
around their genomes that code 
for poorly understood proteins 
containing domains known as 
ankyrin repeats. Poxvirus pro- 
teins containing these repeats 
act as “molecular  flypaper,’ 
Evans says, glomming onto host 
proteins involved with coordi- 
nating the immune response. 
“Orthopoxviruses have these 
arrays of ankyrin repeats, and 
most of them, we don’t really 
know what they target,’ Evans 
says. “But the bottom line is those 
probably hold the key to trying 
to understand why it is some of 
these viruses have the host range 
that they do.” 

Variola, the smallpox virus, appears 
to have lost many of these immune- 
evasion genes. It only persists in humans 
and has no animal reservoir, which was 
why the global vaccination campaign 
could eradicate it. Monkeypox is clearly 
more promiscuous. But there’s no telling 
whether it is capable of creating reservoirs 
in non-African wildlife. “One of the chal- 
lenges has been a lack of interest,’ says 
Lisa Hensley, a microbiologist at the U.S. 
Department of Agriculture who began do- 
ing monkeypox research in 2001 as part of 
a U.S. Army lab. 

Hensley, who worked on monkeypox at 
NIAID for nearly a decade and collaborated 
with Rimoin, urges people to keep an open 
mind about how the virus behaves and what 
it might do next. “We’re recognizing that 
this is a disease we need to worry about 
and that we really don’t know as much as 
we think we know.” & 
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ASTRONOMY 


Telescope in India snares light in spinning mercury 


Inexpensive “liquid mirrors” could be used in giant telescopes on the Moon 


By Daniel Clery 


unique telescope that focuses light 
with a slowly spinning bow] of liquid 
mercury instead of a solid mirror has 
opened its eye to the skies above In- 
dia. Such telescopes have been built 
before, but the 4-meter-wide Interna- 
tional Liquid Mirror Telescope (ILMT) is 
the first large one to be purpose-built for 
astronomy, at the kind of high-altitude site 
observers prize—the 2450-meter Devasthal 
Observatory in the Himalayas. 

Although its users must sat- 
isfy themselves with only look- 
ing straight up, the $2 million 
instrument, built by a consor- 
tium from Belgium, Canada, 
and India, is much cheaper than 
telescopes with glass mirrors. A 
stone’s throw from ILMT is the 
3.6-meter, steerable Devasthal 
Optical Telescope (DOT), built 
by the same Belgian company at 
the same time—but for $18 mil- 
lion. “Simple things are often 
the best,” says Project Director 
Jean Surdej of the University 
of Liége. Some astronomers say 
liquid mirrors are the perfect 
technology for a giant telescope 
on the Moon that could see back 
to the time of the universe’s 
very first stars. 

When a bow! of reflective liquid mercury 
is rotated, the combination of gravity and 
centrifugal force pushes the liquid into a 
perfect parabolic shape, exactly like a con- 
ventional telescope mirror—but without 
the expense of casting a glass mirror blank, 
grinding its surface into a parabola, and 
coating it with reflective aluminum. 

ILMT was originally dreamt up in the late 
1990s. The dish-shaped vessel that holds 
the mercury was delivered to India in 2012, 
but construction of the telescope enclosure 
was delayed. Then researchers found they 
didn’t have enough mercury. As they waited 
for more, the COVID-19 pandemic struck, 
making travel to India impossible. Finally, 
in April, the team set 50 liters of mer- 
cury spinning, creating a parabolic layer 
3.5 millimeters thick. After such a long 
gestation, “we're all very happy,’ says team 
member Paul Hickson of the University of 
British Columbia, Vancouver. 
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Staring straight up, the rotating mirror will 
see a swath of sky almost as wide as the full 
Moon while Earth’s rotation scans it across 
the heavens from dusk to dawn. “You just 
turn it on and let it go,’ Hickson says. Objects 
appear as long streaks in the image; the sepa- 
rate pixels can be added together afterward 
to create a single long exposure. Because the 
telescope sees roughly the same strip of sky 
on successive nights, exposures from many 
nights can be added together to get extremely 
sensitive images of faint objects. 


The International Liquid Mirror Telescope relies on a spinning dish containing 
50 liters of mercury. 


Alternatively, one night’s image can be 
subtracted from the next’s to see what has 
changed, revealing transient objects such 
as supernovae and quasars, the bright 
hearts of distant galaxies that wax and 
wane as supermassive black holes con- 
sume matter. Surdej also wants to hunt for 
gravitational lenses, in which the gravity of 
a galaxy or galaxy cluster bends the light of 
a more distant object like a giant magnify- 
ing glass. ILMT’s sensitive measurement of 
the objects’ brightness can reveal the mass 
distribution of the lens galaxies. A study 
suggested as many as 50 lenses might be 
visible in ILMT’s view of the sky. 

Conventional survey telescopes, such as 
the Zwicky Transient Facility in California 
and the upcoming Vera C. Rubin Obser- 
vatory in Chile, cover much more of the 
sky. But they are unlikely to return to the 
same patch every single night to search for 
changes. “We're forced to have a niche,” 


Hickson says. ILMT has the added strength 
of sitting next to DOT, which is equipped 
with instruments that can rapidly scru- 
tinize any fleeting objects discovered by 
its next-door neighbor. This tag-team 
approach “is more comprehensive, and 
scientifically more rich, says Dipankar 
Banerjee, director of the Aryabhatta Re- 
search Institute of Observational Sciences, 
which runs the Devasthal Observatory. 

If ILMT is a success, Surdej says the 
technology could be scaled up to build 
much larger liquid mirrors on 
the Moon, an attractive location 
for giant telescopes because it 
is less seismically active than 
Earth and has no atmosphere. 
On Earth, the Coriolis effect, 
from the _ planet’s_ rotation, 
would warp the motion of the 
mercury in mirrors larger than 
8 meters. But the Moon rotates 
more slowly, allowing much 
larger liquid mirrors—although 
not of mercury. It is too heavy 
to transport to the Moon and 
would freeze at night and evap- 
orate during the day. But more 
than a decade ago, liquid mirror 
pioneer Ermanno Borra of Laval 
University showed that “ionic 
liquids,” lightweight molten 
salts with low freezing points, 
would survive lunar conditions 
and could be made reflective with a thin 
coating of silver. 

In the 2000s, both NASA and the Cana- 
dian Space Agency commissioned studies 
of lunar liquid mirror telescopes but didn’t 
go any further. Astronomers hope the cur- 
rent interest in Moon exploration and the 
cheap launches offered by rocket compa- 
nies such as SpaceX will spur a revival. In 
2020, a team at the University of Texas, 
Austin, proposed the Ultimately Large 
Telescope, a 100-meter liquid mirror that 
would stare constantly at the same patch 
of sky for years on end from one of the 
Moon’s poles. Such a giant could gather the 
faint trickle of photons from the very first 
stars that lit up the universe, before gal- 
axies even existed. Veteran mirrormaker 
Roger Angel of the University of Arizona 
says there is “a unique niche for a big [liq- 
uid] mirror that goes beyond what others 
can do.” & 
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GLUES TO LONG COVID 


Scientists strive to unravel what is driving disabling symptoms 


hey span three continents, but a 
trio of researchers who’ve never 
met share a singular focus made vi- 
tal by the still-raging pandemic: de- 
ciphering the causes of Long Covid 
and figuring out how to treat it. 
Almost 2 years ago in Italy, pe- 
diatric infectious disease doctor 
Danilo Buonsenso, who works at 
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By Jennifer Couzin-Frankel; 
Illustrations by Ashley Mastin 


Gemelli University Hospital, started to see 
children who, months after mild infec- 
tions with SARS-CoV-2, were still short of 
breath and had crushing fatigue and other 
symptoms. He now suspects that, in some 
of them, the cells and tissues that control 


blood flow are damaged and the blood’s 
tendency to clot is amplified. Minute blood 
clots, leftover from the viral assault or fu- 
eled by its aftermath, might be gumming up 
the body’s circulation, to disastrous effect 
from the brain to joints. “In some patients 
we have specific areas where no blood flow 
comes in” or the flow is reduced, Buonsenso 
says. Is that driving their lingering symp- 
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toms? “I can’t say this is the truth, of course. 
But this makes sense.” 

Meanwhile, in the United States, micro- 
biologist Amy Proal can’t stop thinking 
about a second leading Long Covid theory: 
that the coronavirus keeps hurting people by 
stubbornly enduring in the body, even after 
acute infection passes. Studies have shown 
“the virus is capable of persistence in a wide 
range of body sites,’ especially nerves and 
other tissues, says Proal, who works at the 
PolyBio Research Foundation, a nonprofit 
in Washington state. She recently caught 
COVID-19 for the third time. 

Down under in Australia, immunologist 
Chansavath Phetsouphanh of the Univer- 
sity of New South Wales, Sydney, is chasing 
a third lead, motivated by what the blood 
of Long Covid patients has divulged: an im- 
mune system gone haywire even 8 months 
after they'd first tested positive. He had 
assumed that immune cells galvanized 
to fight off infection would have calmed 
down over that time span. So, “It was a sur- 
prise that these cells did not recover,’ says 
Phetsouphanh, who is working to set up an 
international Long Covid collaboration. 

For each of these researchers—and many 
others exploring the causes of Long Covid— 
untangling the complex syndrome, with 
a still-evolving definition, is a laborious, 
step-wise process. First, they must show 
that a possible contributor—such as mi- 
nuscule clots, lingering virus, or immune 
abnormalities—crops up disproportionately 


Glacial pace for U.S. Long Covid grants 


By Meredith Wadman 


avid Putrino, a neurophysiologist at the Icahn School of Medicine 


at Mount Sinai, labored through his holiday 


a grant application for urgently needed Long Covid research. With 
colleagues, he hoped to tap into $1.15 billion in funding that Con- 


gress granted the National Institutes of Hea 
Long Covid emerged as a major public health pro 
grant applications in December 2021, just weeks 


due date. The agency said it planned to issue deci 


But as of 10 June, Putrino was still waiting to h 
fund his effort to discover whether microclots mi 


in people with Long Covid. Then comes the 
hard part: proving that each of these traits, 
alone or in combination, explains why the 
coronavirus has rendered millions of people 
shadows of their former selves. 

All agree that solo operators are unlikely. 
Lingering virus, for example, could attack 
the circulatory system, triggering blood 
clots or chronic inflammation. “I see this as 
a triangle,” Buonsenso says, with each trig- 
ger potentially explaining, or even amplify- 
ing, the others. 

A final challenge is identifying treatments 
that ease or reverse these abnormalities 


“I consider Long Covid to be 
a massive emergency.” 


Amy Proal, PolyBio Research Foundation 


and help patients feel better. In the United 
Kingdom, home to a widely praised effort to 
identify immediate COVID-19 treatments, 
researchers are launching a clinical trial 
that will be among the largest worldwide 
to test potential Long Covid therapies in 
a randomized, statistically robust man- 
ner. But more such studies are needed— 
and time is of the essence (see sidebar, 
below). In May, the U.S. Centers for Dis- 
ease Control and Prevention reported that 
a review of the medical records of nearly 
2 million people suggested at least one in 
five of those diagnosed with COVID-19 had 


developed conditions characteristic of Long 
Covid. Other studies have found roughly 
similar rates. Some recent research suggests 
the risk for vaccinated people is somewhat 
lower, but vaccination’s power to head off 
the syndrome remains uncertain. 

For Proal and others, fitting the puzzle 
pieces together is of urgent concern. “I 
consider Long Covid to be a massive emer- 
gency,” she says. 


TINY BLOOD CLOTS 

In Rome, Buonsenso is using a sophisti- 
cated medical imaging technique to better 
understand the role of blood clots. Called a 
SPECT-CT scan, it combines two different 
types of images: a single-photon emission 
computed tomography scan, which uses a 
radiotracer injected into a person’s veins to 
provide pictures of blood flow, and a stan- 
dard CT scan for information about lung 
structure. By merging these, doctors can see 
which parts of the lungs aren’t getting nor- 
mal blood flow. 

So far, Buonsenso has scanned the lungs 
of 11 youngsters who have severe Long 
Covid, including irregular heartbeats and 
breathlessness during exercise. In six chil- 
dren, the lungs appeared normal. In the 
other five, the images were striking: Where 
there should have been bright oranges and 
yellows, signifying pulsing blood, one lung 
was nearly completely blue, indicating 
little flow. Buonsenso believes tiny blood 
clots or chronic damage to the lining of 


3712 adults, or 21% of its adult enrollment target of 17,680. Among 
children, numbers are even lower, with 98 children in a study aiming 
to enroll 19,500 of them. 


Critics note that other countries have been more nimble. By July 


ast Christmas to write 


th (NIH) in 2020, as 
blem. NIH had solicited 
before their January 
sions by late March. 
ear whether NIH will 
ight be a meaningful di- 


2021, the United Kingdom had funded 15 Long Covid research projects 
aimed at diagnosis and treatment. A recent independent review pub- 
lished by the Rockefeller Foundation found that, as of February, NIH 
had funded just eight of the 200 Long Covid trials listed in the database 
ClinicalTrials.gov; the trials are being run in many countries. 
NIH acknowledges the critiques and says it has already “obligated 
or committed” the $1.15 billion, slated to be spent over 4 years. But 
the scope of the project prevents it from sprinting, Walter Koroshetz, 
director of NIH’s National Institute of Neurological Disorders and 
Stroke and a co-chair of the RECOVER initiative, implied in a talk to 


agnostic biomarker for many types of Long Covid. “Maybe they should 
hire people who are dedicated to accelerating these programs,” says 
Putrino, who specializes in rehabilitation medicine. “[Long Covid] is a 
national crisis. This does not deserve to be somebody's second or third 
job. What we need from the NIH right now is their full attention.” 
Putrino’s is not the lone complaint about NIH’s management of 
Long Covid research—an initiative dubbed RECOVER, for Research- 
ing COVID to Enhance Recovery. RECOVER's flagship, an obser- 
vational study of up to 40,000 people, has come under fire from 
patient advocates and some scientists who say it lacks transparency 
and is moving far too slowly—a ponderous battleship when a fleet of 
hydroplanes is what's needed. As of 6 June, the study had signed up 
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a panel of advisers to the NIH director on 9 June. “You can't believe 
what a big lift this has been,” he told the advisers. RECOVER “is engi- 
neered to really not leave any stone unturned.” 

NIH added in a statement that it expects to announce winners of 
the long-awaited January funding within 2 weeks. And the agency says 
it has dedicated multiple staff to RECOVER, reinforced by other NIH 
experts and the outside firm Deloitte. But given the growing con- 
cern about Long Covid—one recent report estimated one in five U.S. 
COVID-19 survivors is afflicted—U.S. researchers say more urgency is 
needed (see main story, p. 1261). 

RECOVER's longitudinal study aims to discover the biological 
roots of Long Covid and describe its prevalence, risk factors, and 


science.org SCIENCE 


ILLUSTRATION: A. MASTIN/SCIENCE 


blood vessels may be impairing blood flow. 
He and his colleagues published their first 
evidence of such damage, in a 14-year- 
old girl, in July 2021 in The Lancet 
Child & Adolescent Health. 

The knotty question is what to 
do next. Buonsenso’s_ patients 
don’t meet the current criteria 
for taking anticoagulant drugs 
that prevent blood clots or keep 
them from getting bigger, be- 
cause no clots are visible on 
their scans. “You either say, ‘’m 
sorry, but I can’t do anything 
outside of a study, or you are 
kind of a not-scared doctor with 
reciprocal trust with the patient 
and family, and together make a de- 
cision,” Buonsenso says. 

With seriously ill children and 
their desperate parents in front of him, 
Buonsenso didn’t want to wait to launch a 
clinical trial. He and the families instead 
decided to do what they could. All five 


COVID-19 more than 2 years ago, she wasn’t 
hospitalized but months later struggled 
with fatigue, breathlessness when walk- 
ing, persistent fever, joint pain, and 
other symptoms. Yet bloodwork and 
other medical tests, such as x-rays 
of her lungs, were largely normal. 
Esperti kept pushing for answers. 
Then, in the spring of 2021, “I re- 
ceived a call from the pulmono- 
logist saying, ‘Listen, I want you 
to do another test?” 

Esperti was one of the first 
Long Covid patients in Europe to 
receive a lung SPECT-CT scan, and 

it looked just like the compromised 
lungs of the sick children Buonsenso 
scanned later. “My right lung is almost 
completely blue,’ she says. She posted 
the images on Twitter, where they were 
shared by thousands of people. 

Like Buonsenso, Esperti’s doctors believe 
damage to tiny blood vessels and minuscule 
clots are the culprit. The theory of persis- 


patients are now taking anticoagulants— 
which carry the risk of severe bleeding— 
under close monitoring. 

He’s heartened by their progress. Sev- 
eral are back in school, playing sports, and 
spending time with friends. Two have been 
rescanned. A teenager who still suffers from 
symptoms showed little improvement. But 
in another, whose symptoms have largely re- 
solved, the images looked nearly normal. The 
other three will be rescanned this summer. 


Of course, he can’t be sure those who 
improved did so because of the treatment. 
That’s why he is hoping for resources to 
establish a Long Covid clinical trial that 
would include a placebo group. 

Like other doctors, Buonsenso has been 
inspired by Long Covid patients who have 
lobbied hard for treatments and shared suc- 
cesses and failures on social media. Marta 
Esperti, a 34-year-old graduate student now 
living in Paris, is one of them. Struck by 


tent microclots gained further credence in 
the summer of 2021, when Resia Pretorius, 
a physiologist at Stellenbosch University in 
South Africa, and her colleagues reported 
in Cardiovascular Diabetology that such 
clots could linger in the blood of Long Co- 
vid patients. Her team found signs of exces- 
sive blood clotting in 11 people with Long 
Covid, but not in healthy people or another 
group with type 2 diabetes, whom they used 
for comparison. 


symptoms. It will also include clinical trials of treatments and pre- 
ventives, which NIH hopes to launch by fall. 

“Everybody is working as fast as possible but this is a monster of 
a study,” says immunologist Janko Nikolich-Zugich of the University 
of Arizona College of Medicine, Tucson, a principal investigator for a 
RECOVER arm in Arizona. “It has been a nightmare to both put together 
and to run ... in part because Long Covid comes in so many flavors.” 

Ayear ago NIH awarded New York University’s (NYU's) Grossman 
School of Medicine a huge chunk—$448 million—of its Long Covid fund- 
ing to run the longitudinal study, through subawards NYU made to more 
than 24 institutions. NYU declined to make RECOVER investigators 
available for interviews or respond to written questions about the study. 

Meanwhile, extramural scientists not involved in RECOVER are 
finding other ways to fund research, because NIH has put out few 
new requests for applications since it first launched the initiative. 
Michael VanElzakker, a neuroscientist at Massachusetts General 
Hospital, was already using a rare, sophisticated brain scanner to 
run a battery of tests on cognitive function in people with myalgic 
encephalomyelitis/chronic fatigue syndrome when the pandemic 
descended. As a result, he says, “| have a whole neuroimaging 
pipeline that’s ripe for Long Covid people to go through.” But he’s 
relying on charitable donations, because “there's not really a way 
to apply for [NIH] Long Covid funding per se.” He recently applied 
for a general NIH neuroscience grant but worries his proposal won't 
fare well when “binned in with somebody who's got a model of ALS 
[amyotrophic lateral sclerosis].” 
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Others complain about NIH’s opaqueness. For instance, the RECOVER 
website does not list award amounts to institutions subcontracted 
through NYU, or tally how much has been spent. “| wish there was more 
transparency regarding instruments, enrollment, data, and where and 
when and with whom the money has been invested and what it has 
yielded so far,’ says Harlan Krumholz, a cardiologist at the Yale School of 
Medicine. He co-authored Rockefeller’s policy briefing, which excoriated 
NIH's Long Covid response. NIH says it is working to add grant funding 
and other details to its searchable public database, REPORTER. 

Still others remain frustrated by NIH'’s tight application deadlines. For 
example, on 27 April the agency announced a competition for funding 
for Long Covid clinical trials—with a preferred submission deadline of 
19 May. “| decided not to apply because the deadline” was so short, one 
prominent immunologist says. 

But Nikolich-Zugich says the agency is doing the best it can. “I have 
seen an incredible number of people from the NIH work incredibly 
hard on all of this on very compressed timelines,” he says. 

Mady Hornig, a neuropsychiatrist at Columbia University who has 
Long Covid and is a patient representative in RECOVER, notes NIH 
must balance moving quickly with preserving rigor. The goal of the fast 
funding turnarounds “is a really laudable one: not to be slowed down 
by the usual NIH cycle and still allow for high-quality peer review,’ she 
says. Still, she agrees that when it comes to funding research on this 
mysterious condition, “There have been a few hiccups along the way.” 

Koroshetz said last week that NIH is doing its all. “We're hoping we 
can ... really make a difference to people. And the sooner the better.” 
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One challenge facing clinicians keen to 
hunt for microclots in blood is that detect- 
ing them is a laborious process, although 
Pretorius is working to make it more accessi- 
ble to doctors. SPECT-CT scans identify clots 
indirectly, based on blood flow abnormalities. 

The work by Pretorius and others fits 
an emerging pattern in which the biology 
of acute and chronic COVID-19 appears 
to overlap. Early in the pandemic, doctors 
recognized blood clots as a signature of 
early, severe illness: Many hospitalized pa- 
tients had clots in their lungs, brain, and 
elsewhere. Even people with milder disease 
were at heightened risk of heart attacks and 
strokes in the weeks following infection. 

“The two diseases”—acute COVID-19 and 
Long Covid—“aren’t very different,” posits 
David Lee, an emergency medicine doctor 
at New York University Grossman School 
of Medicine. He suspects microclots ex- 
plain many chronic symptoms. At least 70% 
of Long Covid patients have respiratory 
problems, he estimates, and at least 30% 
suffer from dysautonomia, in which ab- 
normalities in the autonomic nervous 
system disrupt heart rate, breathing, 
and digestive function. Many suf- 
fer from fatigue and what’s often 
called “brain fog.” Tiny clots in 
the brain could explain cognitive 
troubles, Lee notes; or clots may 
kill small fiber nerve cells and 
drive dysautonomia. 

But solid evidence that micro- 
clots cause Long Covid symptoms 
is still lacking. To learn more, Lee 
launched a study of 20 Long Covid 
patients with respiratory symptoms 
and 20 healthy volunteers who had 
COVID-19 and recovered. The partici- 
pants will undergo SPECT-CT scans to see 
whether any of them have disrupted lungs. 
If abnormalities on the scans are unique to 
some with Long Covid, that could be a step 
toward identifying subsets of patients. 

“T oscillate in my excitement” for dif- 
ferent Long Covid theories, says David 
Putrino, a neurophysiologist and reha- 
bilitation medicine specialist at the Icahn 
School of Medicine at Mount Sinai. He is 
studying several possible mechanisms for 
why symptoms can linger or even worsen 
after an infection. Right now, his enthusi- 
asm for the microclot hypothesis is run- 
ning high, thanks in part to a collaboration 
with Pretorius and others; that team has 
preliminary data to suggest the amount of 
microclots in blood correlates with the se- 
verity of some Long Covid symptoms, such 
as cognitive deficits. 

Putrino is now running a small study 
of Long Covid patients with microclots in 
blood plasma, testing whether apheresis, 


1264 17 JUNE 2022 + VOL 376 ISSUE 6599 


which filters blood and reinfuses it, im- 
proves their symptoms. Any results, though, 
will be “very preliminary,’ he cautions. 
Apheresis can “filter out lots of things in 
the blood” that also might fuel symptoms. 


PERSISTENT VIRUS 

The idea that lingering virus might be a 
Long Covid culprit solidified for Proal in 
late 2021, thanks in part to an autopsy study 
that captured her attention. Researchers 
from the U.S. National Institutes of Health 
described in a preprint their analysis of tis- 
sues from 44 people who had been infected. 
Most had died from COVID-19 but five had 
mild or even asymptomatic infection and 
had succumbed to something else. All 44 still 
harbored viral RNA in their body, including 
in the brain, muscle, gut, and lungs. Many 
organs also had evidence of replicating virus. 


The study established that replicating 
copies of the virus could persist. But it 
didn’t focus on people with Long Covid, 
making it difficult to establish a link. 

Other studies are now trying to do that 
by comparing those with Long Covid and 
other cohorts, including people who have 
recovered from COVID-19. In one, gastro- 
enterologist Herbert Tilg at the Medical 
University of Innsbruck set out to look for 
molecular traces of virus in the gut, a favor- 
ite hunting ground for researchers study- 
ing SARS-CoV-2. That’s because the gut is 
far easier for doctors to access than many 
other organs, such as the lungs or the brain, 
and is thought likelier to harbor virus than 
blood, which tends to clear a pathogen 
more quickly. 


Tilg recruited 46 people who'd had 
COVID-19 months earlier, 21 of whom had 
at least one symptom of Long Covid. All the 
volunteers had inflammatory bowel disease 
and were scheduled for routine endoscopies. 
(Most were in remission.) Tilg used that pro- 
cedure to gather extra tissue samples and 
probe for signs of the coronavirus. The bot- 
tom line: All of those with Long Covid symp- 
toms harbored viral RNA or, in some cases, 
viral proteins. Among those who'd recovered 
from acute COVID-19, 11 had traces of virus, 
too, whereas another 14 had no virus that 
Tilg could find. He and colleagues published 
their study in May in Gastroenterology. 

“Our paper is a clinical observation,” he 
stresses, and doesn’t prove that lingering 
virus is harming people. That said, Tilg was 
surprised by the presence of persistent vi- 
rus in people with Long Covid symptoms, 
and its absence in many previously infected 
people who felt fine. 

Now, researchers like Tilg want to under- 
stand how the small amounts of virus that 
linger in tissue reserves such as the gut 

behave. Tilg and Saurabh Mehandru, a 
gastroenterologist and immunologist 

at Mount Sinai who was among the 

first to find signs of persistent vi- 

rus in COVID-19 survivors, haven’t 
been able to culture the virus 
from gut tissue in the lab—which 
would confirm that the virus is 
replicating. But such culturing 
can be tough, even in acute cases 
of COVID-19. 

A key question is whether linger- 

ing virus is driving illness in Long Co- 

vid patients. Proal and her colleagues 
are trying to definitively tie persistent 
virus to Long Covid symptoms—or de- 
termine that the theory doesn’t hold wa- 
ter. She is working with Mehandru and with 
Putrino, who has access to a cohort of more 
than 1000 Long Covid patients who could 
participate in new studies. The team is plan- 
ning colonoscopies to look for virus in intes- 
tinal cells, and also wants to scrutinize the 
activity of immune cells that populate the 
region. Are these cells in a heightened state 
of alert, for example, suggesting the viral par- 
ticles are setting them off? In some patients, 
Mehandru has noted immune responses in 
the gut, but says the work is preliminary. 

Ideally, this deep dive into the gut’s 
biology would be paired with a clinical trial 
examining whether antiviral drugs can beat 
back Long Covid symptoms. Participants 
could be studied before and after a course 
of antivirals, to see whether they clear any 
signs of virus in their gut and whether that 
matches up with changes in symptoms. 
“That’s the trial I’d love to see happen,’ 
Putrino says. 
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So far it hasn’t, to the frustration of many 
in the Long Covid research and advocacy 
communities. “The patients are really, really 
desperate,” says Francisco Tejerina, an HIV 
researcher at Gregorio Marafion Hospital 
in Madrid, who has detected viral RNA in 
the stool, urine, and blood plasma of people 
with Long Covid. He worries about people 
experimenting on their own with therapies, 
including antivirals, that may be useless, or 
worse, dangerous. “They want something 
now,’ he says. “I understand that.” 


HAYWIRE IMMUNE SYSTEM 

As the pursuit of blood clots and viral frag- 
ments intensifies, a third track is both in- 
tersecting with those themes and making 
inroads on its own: the immune system, 
which some researchers believe could help 
bind together disparate observations. One 
idea, Phetsouphanh says, is that in some 
COVID-19 patients an immune system 
revved up and destabilized by the corona- 
virus attack may be unable to reset itself 
to idle. 

To test that, he and colleagues gathered 
blood from 31 Long Covid patients, who all 
had fatigue or other characteristic symp- 
toms at least 3 months after infection, 
and analyzed dozens of immune markers. 
“We were uncertain what we would find,’ 
Phetsouphanh says. The result was striking. 
Essentially, Long Covid patients had an im- 
mune system in constant high alert, the team 
reported in January in Nature Immunology. 

White blood cells that typically recruit 
other cells to sites of infection were highly 
activated, which may explain why the pa- 
tients’ levels of interferons, proteins the 
body makes to fight invaders, were sky high 
8 months after infection. The participants 
also had a dearth of inactivated T cells and B 
cells, a population of cells that normally put- 
ters about awaiting instruction to counter 
pathogens. Collectively this signaled chronic 
inflammation, which can cause a host of 
health problems. 

Phetsouphanh and his colleagues also 
found these immune signatures were 
unique to their Long Covid cohort: They 
didn’t appear in people who’d had the vi- 
rus but recovered, or in those who’d been 
infected with different coronaviruses. 

Other groups are also finding myriad 
immune system abnormalities long after a 
SARS-CoV-2 infection. In January, immuno- 
logist Akiko Iwasaki at the Yale School of 
Medicine, neuroscientist Michelle Monje 
at Stanford Medicine, and their colleagues 
reported in a preprint that mice given the 
coronavirus in their nose to mimic a mild 
infection developed inflammation in the 
brain, as well as loss of myelin, which insu- 
lates nerves and helps them transmit signals. 
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In the mice, inflammation 
struck the brain even though 
the researchers couldn’t 
find virus there. Further- 
more, when the group com- 
pared blood samples from 
48 people with Long Covid 
who had cognitive impair- 
ments and 15 Long Covid 
patients who did not, they 
found higher levels of in- 
flammatory markers in 
the first group, suggesting 
chronic inflammation may 
be driving these neurological 
symptoms. 

Such work highlights “how 
little we know about immune 
function,” says Putrino, who is col- 
laborating with Iwasaki. Work like 
hers is “opening up this world” of 
immune markers that aren’t part of stan- 
dard blood tests but may be critical to un- 
derstanding the syndrome. 

Increasingly, researchers want to fine- 
tune how they classify people with Long 
Covid, differentiating subsets based on 
symptoms, biology, or both. In a way, “the 
biggest obstacle that we are facing is we 
gave it one name, we gave it the name of 
Long Covid, which implies that it is one 
disease,” says Chahinda Ghossein, a phy- 
sician and heart disease researcher at 
Maastricht University and co-leader of a 
15,000-patient Long Covid study in the 
Netherlands. “All the studies being per- 
formed show us that it is not.” 

One of the first large efforts to define 
subgroups, in part by testing potential 
treatments, will begin to enroll patients 
this month in the United Kingdom. Called 
STIMULATE-ICP (Symptoms, Trajectory, 
Inequalities and Management: Under- 
standing Long-COVID to Address and 
Transform Existing Integrated Care Path- 
ways), it will recruit 4500 Long Covid 
patients. Each will be randomly assigned 
one of three potential therapies: an anti- 
clotting drug called rivaroxaban, an anti- 
inflammatory called colchicine, or a pair 
of antihistamines, famotidine and lorata- 
dine. (Antihistamines can quell a type of 
inflammation called mast-cell activation.) 
The project will include imaging as well as 
the collection of blood and tissue samples for 
a biobank. Organizers also have the option 
of adding up to three additional treatments. 

Amitava Banerjee, a cardiologist at Uni- 
versity College London, leads the study, 
and his colleague Emma Wall at UCL and 
the Francis Crick Institute is overseeing 
the treatment element. Both say it’s crucial 
to test Long Covid therapies in trials before 
prescribing them en masse to patients. 


“If you follow social media, you might 
think we’ve cracked it,” Banerjee says, refer- 
ring to anecdotal reports of vastly improved 
Long Covid symptoms, after, say, taking 
anticoagulants or antiviral drugs. Those 
narratives can offer important clues, he 
says, stressing that working with and listen- 
ing to patients is invaluable. (Every compo- 
nent of STIMULATE-ICP includes a patient 
advocate.) But, “We’ve got to be really hum- 
ble” about what we know, Banerjee adds. 

After months of delays, the trial is “close 
to the start line,” he says, but “we should 
have been there months ago.” Trials for 
drugs to treat acute COVID-19 were fast- 
tracked, while “Long Covid is joining the 
queue with everything else” in medicine, 
Banerjee laments. 

Esperti’s story underscores the work to 
be done. She’s been taking anticoagulant 
drugs for more than a year, and can now 
work, swim, drive, and cook—all previ- 
ously impossible tasks. But she still can’t 
run or carry heavy objects, and she suffers 
from sporadic fatigue. She was hit by a sec- 
ond bout of COVID-19 in December 2021 
that left her severely debilitated, but she 
improved after a fourth dose of vaccine in 
late March. 

Mysteries remain. A repeat SPECT-CT 
scan 3 months after she started taking 
the drugs showed no change in her lungs, 
though her lung function has improved. 
“T thought all I have to do is take anti- 
coagulants” and recovery would be quick, 
she says. Esperti suspects that, like peel- 
ing the layers of an onion, she’s unrav- 
eled one trigger for her symptoms only to 
come upon another she can’t yet decode. 
It’s another chapter in the annals of Long 
Covid—a narrative researchers are dog- 
gedly trying to change. & 
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toms? “I can’t say this is the truth, of course. 
But this makes sense.” 

Meanwhile, in the United States, micro- 
biologist Amy Proal can’t stop thinking 
about a second leading Long Covid theory: 
that the coronavirus keeps hurting people by 
stubbornly enduring in the body, even after 
acute infection passes. Studies have shown 
“the virus is capable of persistence in a wide 
range of body sites,’ especially nerves and 
other tissues, says Proal, who works at the 
PolyBio Research Foundation, a nonprofit 
in Washington state. She recently caught 
COVID-19 for the third time. 

Down under in Australia, immunologist 
Chansavath Phetsouphanh of the Univer- 
sity of New South Wales, Sydney, is chasing 
a third lead, motivated by what the blood 
of Long Covid patients has divulged: an im- 
mune system gone haywire even 8 months 
after they'd first tested positive. He had 
assumed that immune cells galvanized 
to fight off infection would have calmed 
down over that time span. So, “It was a sur- 
prise that these cells did not recover,’ says 
Phetsouphanh, who is working to set up an 
international Long Covid collaboration. 

For each of these researchers—and many 
others exploring the causes of Long Covid— 
untangling the complex syndrome, with 
a still-evolving definition, is a laborious, 
step-wise process. First, they must show 
that a possible contributor—such as mi- 
nuscule clots, lingering virus, or immune 
abnormalities—crops up disproportionately 


Glacial pace for U.S. Long Covid grants 


By Meredith Wadman 


avid Putrino, a neurophysiologist at the Icahn School of Medicine 


at Mount Sinai, labored through his holiday 


a grant application for urgently needed Long Covid research. With 
colleagues, he hoped to tap into $1.15 billion in funding that Con- 


gress granted the National Institutes of Hea 
Long Covid emerged as a major public health pro 
grant applications in December 2021, just weeks 


due date. The agency said it planned to issue deci 


But as of 10 June, Putrino was still waiting to h 
fund his effort to discover whether microclots mi 


in people with Long Covid. Then comes the 
hard part: proving that each of these traits, 
alone or in combination, explains why the 
coronavirus has rendered millions of people 
shadows of their former selves. 

All agree that solo operators are unlikely. 
Lingering virus, for example, could attack 
the circulatory system, triggering blood 
clots or chronic inflammation. “I see this as 
a triangle,” Buonsenso says, with each trig- 
ger potentially explaining, or even amplify- 
ing, the others. 

A final challenge is identifying treatments 
that ease or reverse these abnormalities 


“I consider Long Covid to be 
a massive emergency.” 


Amy Proal, PolyBio Research Foundation 


and help patients feel better. In the United 
Kingdom, home to a widely praised effort to 
identify immediate COVID-19 treatments, 
researchers are launching a clinical trial 
that will be among the largest worldwide 
to test potential Long Covid therapies in 
a randomized, statistically robust man- 
ner. But more such studies are needed— 
and time is of the essence (see sidebar, 
below). In May, the U.S. Centers for Dis- 
ease Control and Prevention reported that 
a review of the medical records of nearly 
2 million people suggested at least one in 
five of those diagnosed with COVID-19 had 


developed conditions characteristic of Long 
Covid. Other studies have found roughly 
similar rates. Some recent research suggests 
the risk for vaccinated people is somewhat 
lower, but vaccination’s power to head off 
the syndrome remains uncertain. 

For Proal and others, fitting the puzzle 
pieces together is of urgent concern. “I 
consider Long Covid to be a massive emer- 
gency,” she says. 


TINY BLOOD CLOTS 

In Rome, Buonsenso is using a sophisti- 
cated medical imaging technique to better 
understand the role of blood clots. Called a 
SPECT-CT scan, it combines two different 
types of images: a single-photon emission 
computed tomography scan, which uses a 
radiotracer injected into a person’s veins to 
provide pictures of blood flow, and a stan- 
dard CT scan for information about lung 
structure. By merging these, doctors can see 
which parts of the lungs aren’t getting nor- 
mal blood flow. 

So far, Buonsenso has scanned the lungs 
of 11 youngsters who have severe Long 
Covid, including irregular heartbeats and 
breathlessness during exercise. In six chil- 
dren, the lungs appeared normal. In the 
other five, the images were striking: Where 
there should have been bright oranges and 
yellows, signifying pulsing blood, one lung 
was nearly completely blue, indicating 
little flow. Buonsenso believes tiny blood 
clots or chronic damage to the lining of 


3712 adults, or 21% of its adult enrollment target of 17,680. Among 
children, numbers are even lower, with 98 children in a study aiming 
to enroll 19,500 of them. 


Critics note that other countries have been more nimble. By July 


ast Christmas to write 


th (NIH) in 2020, as 
blem. NIH had solicited 
before their January 
sions by late March. 
ear whether NIH will 
ight be a meaningful di- 


2021, the United Kingdom had funded 15 Long Covid research projects 
aimed at diagnosis and treatment. A recent independent review pub- 
lished by the Rockefeller Foundation found that, as of February, NIH 
had funded just eight of the 200 Long Covid trials listed in the database 
ClinicalTrials.gov; the trials are being run in many countries. 
NIH acknowledges the critiques and says it has already “obligated 
or committed” the $1.15 billion, slated to be spent over 4 years. But 
the scope of the project prevents it from sprinting, Walter Koroshetz, 
director of NIH’s National Institute of Neurological Disorders and 
Stroke and a co-chair of the RECOVER initiative, implied in a talk to 


agnostic biomarker for many types of Long Covid. “Maybe they should 
hire people who are dedicated to accelerating these programs,” says 
Putrino, who specializes in rehabilitation medicine. “[Long Covid] is a 
national crisis. This does not deserve to be somebody's second or third 
job. What we need from the NIH right now is their full attention.” 
Putrino’s is not the lone complaint about NIH’s management of 
Long Covid research—an initiative dubbed RECOVER, for Research- 
ing COVID to Enhance Recovery. RECOVER's flagship, an obser- 
vational study of up to 40,000 people, has come under fire from 
patient advocates and some scientists who say it lacks transparency 
and is moving far too slowly—a ponderous battleship when a fleet of 
hydroplanes is what's needed. As of 6 June, the study had signed up 
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a panel of advisers to the NIH director on 9 June. “You can't believe 
what a big lift this has been,” he told the advisers. RECOVER “is engi- 
neered to really not leave any stone unturned.” 

NIH added in a statement that it expects to announce winners of 
the long-awaited January funding within 2 weeks. And the agency says 
it has dedicated multiple staff to RECOVER, reinforced by other NIH 
experts and the outside firm Deloitte. But given the growing con- 
cern about Long Covid—one recent report estimated one in five U.S. 
COVID-19 survivors is afflicted—U.S. researchers say more urgency is 
needed (see main story, p. 1261). 

RECOVER's longitudinal study aims to discover the biological 
roots of Long Covid and describe its prevalence, risk factors, and 
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blood vessels may be impairing blood flow. 
He and his colleagues published their first 
evidence of such damage, in a 14-year- 
old girl, in July 2021 in The Lancet 
Child & Adolescent Health. 

The knotty question is what to 
do next. Buonsenso’s_ patients 
don’t meet the current criteria 
for taking anticoagulant drugs 
that prevent blood clots or keep 
them from getting bigger, be- 
cause no clots are visible on 
their scans. “You either say, ‘’m 
sorry, but I can’t do anything 
outside of a study, or you are 
kind of a not-scared doctor with 
reciprocal trust with the patient 
and family, and together make a de- 
cision,” Buonsenso says. 

With seriously ill children and 
their desperate parents in front of him, 
Buonsenso didn’t want to wait to launch a 
clinical trial. He and the families instead 
decided to do what they could. All five 


COVID-19 more than 2 years ago, she wasn’t 
hospitalized but months later struggled 
with fatigue, breathlessness when walk- 
ing, persistent fever, joint pain, and 
other symptoms. Yet bloodwork and 
other medical tests, such as x-rays 
of her lungs, were largely normal. 
Esperti kept pushing for answers. 
Then, in the spring of 2021, “I re- 
ceived a call from the pulmono- 
logist saying, ‘Listen, I want you 
to do another test?” 

Esperti was one of the first 
Long Covid patients in Europe to 
receive a lung SPECT-CT scan, and 

it looked just like the compromised 
lungs of the sick children Buonsenso 
scanned later. “My right lung is almost 
completely blue,’ she says. She posted 
the images on Twitter, where they were 
shared by thousands of people. 

Like Buonsenso, Esperti’s doctors believe 
damage to tiny blood vessels and minuscule 
clots are the culprit. The theory of persis- 


patients are now taking anticoagulants— 
which carry the risk of severe bleeding— 
under close monitoring. 

He’s heartened by their progress. Sev- 
eral are back in school, playing sports, and 
spending time with friends. Two have been 
rescanned. A teenager who still suffers from 
symptoms showed little improvement. But 
in another, whose symptoms have largely re- 
solved, the images looked nearly normal. The 
other three will be rescanned this summer. 


Of course, he can’t be sure those who 
improved did so because of the treatment. 
That’s why he is hoping for resources to 
establish a Long Covid clinical trial that 
would include a placebo group. 

Like other doctors, Buonsenso has been 
inspired by Long Covid patients who have 
lobbied hard for treatments and shared suc- 
cesses and failures on social media. Marta 
Esperti, a 34-year-old graduate student now 
living in Paris, is one of them. Struck by 


tent microclots gained further credence in 
the summer of 2021, when Resia Pretorius, 
a physiologist at Stellenbosch University in 
South Africa, and her colleagues reported 
in Cardiovascular Diabetology that such 
clots could linger in the blood of Long Co- 
vid patients. Her team found signs of exces- 
sive blood clotting in 11 people with Long 
Covid, but not in healthy people or another 
group with type 2 diabetes, whom they used 
for comparison. 


symptoms. It will also include clinical trials of treatments and pre- 
ventives, which NIH hopes to launch by fall. 

“Everybody is working as fast as possible but this is a monster of 
a study,” says immunologist Janko Nikolich-Zugich of the University 
of Arizona College of Medicine, Tucson, a principal investigator for a 
RECOVER arm in Arizona. “It has been a nightmare to both put together 
and to run ... in part because Long Covid comes in so many flavors.” 

Ayear ago NIH awarded New York University’s (NYU's) Grossman 
School of Medicine a huge chunk—$448 million—of its Long Covid fund- 
ing to run the longitudinal study, through subawards NYU made to more 
than 24 institutions. NYU declined to make RECOVER investigators 
available for interviews or respond to written questions about the study. 

Meanwhile, extramural scientists not involved in RECOVER are 
finding other ways to fund research, because NIH has put out few 
new requests for applications since it first launched the initiative. 
Michael VanElzakker, a neuroscientist at Massachusetts General 
Hospital, was already using a rare, sophisticated brain scanner to 
run a battery of tests on cognitive function in people with myalgic 
encephalomyelitis/chronic fatigue syndrome when the pandemic 
descended. As a result, he says, “| have a whole neuroimaging 
pipeline that’s ripe for Long Covid people to go through.” But he’s 
relying on charitable donations, because “there's not really a way 
to apply for [NIH] Long Covid funding per se.” He recently applied 
for a general NIH neuroscience grant but worries his proposal won't 
fare well when “binned in with somebody who's got a model of ALS 
[amyotrophic lateral sclerosis].” 
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Others complain about NIH’s opaqueness. For instance, the RECOVER 
website does not list award amounts to institutions subcontracted 
through NYU, or tally how much has been spent. “| wish there was more 
transparency regarding instruments, enrollment, data, and where and 
when and with whom the money has been invested and what it has 
yielded so far,’ says Harlan Krumholz, a cardiologist at the Yale School of 
Medicine. He co-authored Rockefeller’s policy briefing, which excoriated 
NIH's Long Covid response. NIH says it is working to add grant funding 
and other details to its searchable public database, REPORTER. 

Still others remain frustrated by NIH'’s tight application deadlines. For 
example, on 27 April the agency announced a competition for funding 
for Long Covid clinical trials—with a preferred submission deadline of 
19 May. “| decided not to apply because the deadline” was so short, one 
prominent immunologist says. 

But Nikolich-Zugich says the agency is doing the best it can. “I have 
seen an incredible number of people from the NIH work incredibly 
hard on all of this on very compressed timelines,” he says. 

Mady Hornig, a neuropsychiatrist at Columbia University who has 
Long Covid and is a patient representative in RECOVER, notes NIH 
must balance moving quickly with preserving rigor. The goal of the fast 
funding turnarounds “is a really laudable one: not to be slowed down 
by the usual NIH cycle and still allow for high-quality peer review,’ she 
says. Still, she agrees that when it comes to funding research on this 
mysterious condition, “There have been a few hiccups along the way.” 

Koroshetz said last week that NIH is doing its all. “We're hoping we 
can ... really make a difference to people. And the sooner the better.” 
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Oil at sea—how much 
is too much? 


The anthropogenic share of 


marine oil discharge is much larger 
than previously thought 


a al 
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By Ira Leifer 


atural oil seepage from seafloor pe- 
troleum hydrocarbon reservoirs is 
a chemical stress on marine life (1). 
It also is a chemosynthetic energy 
source that allows the existence of 
chemosynthetic lifeforms and com- 
munities. These hydrocarbon emissions 
drive ecological adaptations and support 
high biomass (2). Additional oil enters the 
ocean from accidental and operational re- 
leases and primarily manifests as chronic 
and transient slicks found along shipping 
lanes and in coastal waters (3). Trends for 
small oil releases have been unclear be- 
cause of the paucity of monitoring methods. 
Satellite synthetic aperture radar (SAR) is 
highly promising for monitoring small 
spills and oil seepage slicks. On page 1300 
of this issue, Dong et al. (3) find that oil 
slicks are far more widespread than prior 
estimates (J). They analyzed more than 
500,000 satellite SAR images from 2014 to 
2019 and found over 450,000 oil slicks, with 
the vast majority originating from anthro- 
pogenic releases or discharges. 
Slicks were found over hotspots associ- 
ated with natural marine seepage, offshore 
oil production, and shipping lanes. Dong 
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et al. found that natural seeps contributed 
6% in 2014 to 2019, whereas anthropogenic 
discharges, such as those from ships, oil 
platforms, and pipelines, contributed 94%. 
This finding was a notable change com- 
pared to the roughly 50/50 split during 
the 1990s (1). This change likely resulted 
in part from shipping expansion during 
recent decades, particularly in developing 
countries’ waters. This latest assessment of 
the marine anthropogenic oil budget im- 
plies substantial, yet poorly characterized, 
ecological implications. 

Insights into the ecological implications 
of anthropogenic marine oil emissions are 
provided by natural marine oil seepage 
studies (4). The resulting ecosystem adap- 
tations depend on the hydrocarbon flux, 
persistence, and composition. These vary 
spatially and alter the community species 
composition based on the balance between 
toxicity and bioavailable energy. For ex- 
ample, the decrease in seep hydrocarbon 
concentration with distance from the sea- 
bed source correlates to community com- 
position shifts along this gradient (5). Most 
seep ecosystem studies focus on deepsea 
systems far below the photic zone (6). Oil 
seepage also affects shallow communities 
where there is competition with species that 
utilize sunlight-derived energy (2, 4). Given 
that most anthropogenic oil slicks occur 
in shallow coastal waters (3), improving 


understanding of shallow seep ecosystems 
is highly relevant. 

Seepage ecosystem studies largely have fo- 
cused on the lowest chemosynthetic trophic 
level of the food chain, microbes (6). The 
seep microbiome supports the next trophic 
food chain level—namely, the sediment 
seep communities, which include copepods, 
nematodes, foraminifera, and higher trophic 
levels of macrofauna, such as tubeworms, 
bivalves, and sponges. These species domi- 
nate the seep community and support pred- 
atory and mobile feeders such as shrimp, 
crabs, and fish (6). Seep communities have 
high-density biomass compared to the sur- 
rounding seabed, albeit at reduced species 
diversity (7). Still, solely seep-dependent 
species are constrained to the vicinity of the 
area of active seepage (8); their presence is 
a strong indicator of natural seepage (5). 
These seep communities could play a critical 
role in the survival of opportunistic mobile 
predators, such as during periods of food 
scarcity—similar to “watering holes” during 
the dry season in the African savannah (2). 
Researchers also have proposed that seep- 
age plays a role in the food needs of whales 
in the Gulf of Mexico (2). 

The seep community specializes or adapts 
according to their surroundings. For exam- 
ple, seep communities in the deep sea often 
have seep specialist organisms that leverage 
the ecosystem niches created by the seep 
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bioavailable energy and the toxicity of the 
seep petroleum. Such seep specialists com- 
pete poorly against nonspecialists outside 
the seep environs and against species that 
opportunistically use both chemosynthetic- 
derived and Sun-derived energy at shallow 
seepage sites (9). 

As the already dominant anthropogenic 
petroleum footprint continues expanding, 
these chemosynthetic niches grow at the ex- 
pense of endangered and protected species 
and ecosystems. This threat occurs both di- 
rectly by toxicity and indirectly by ecosystem 
adaptations that shift species composition— 
as is observed in seep communities. Such 
shifts affect the food supply of predators such 
as fish and crabs, which also are under stress 
from non-oil pollution and factors like ocean 
acidification and warming oceans. 

In response, laws, regulations, and poli- 
cies globally seek to minimize anthropo- 
genic releases (10). Mitigation falls into two 
distinct approaches: proactively—through 
regulation; and reactively—through oil 
spill response and penalties. The latter’s 
success is mixed at best for large offshore 
spills (2), with mitigation approaches 
having negative environmental impacts 
that should be assessed against natural 
mitigation (12). However, societal concerns 
inevitably also influence response deci- 
sions (and regulation development and 
implementation). 
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After analyzing >500,000 satellite images of global 
oceans, Dong et al. found more than 450,000 oil slicks, 
which cover a much larger area than prior estimates, 
with more than 90% coming from human activities. 


Dong et al. report that connecting slicks 
to shipborne sources was difficult to infea- 
sible—a critical factor in effective regulatory 
enforcement. Dense SAR coverage of coastal 
waters, as implemented by CleanSeaNet for 
Europe, combined with ship time-history and 
back-trajectory modeling, can connect slicks 
to their sources. CleanSeaNet spill detection 
is backed up by marine and airborne patrol 
dispatch within 3 hours (13). According to 
some estimates (J4), North Sea bird oilings 
decreased by up to an order of magnitude 
during the 2000s, depending on species. This 
encouraging decrease corresponded to the 
program’s implementation. 

Successful regulations must shift the bal- 
ance from the operational cost of pollution 
toward compliance costs. Effective enforce- 
ment is key. Penalties and enforcement 
likelihood need to exceed the maintenance 
and retrofit costs to incentivize against op- 
erational oil releases. Moreover, the develop- 
ment of effective regulation requires mean- 
ingful progress metrics, which currently are 
largely absent. Here, satellite SAR provides 
a globally available tool to support environ- 
mental management and spill response to 
reduce anthropogenic oil and its impacts. 

The current understanding of the ecologi- 
cal impacts of the typical small marine oil 
releases is poor. Given the overlap, this high- 
lights the need for studies of shallow, coastal 
seepage ecosystems where most releases oc- 
cur. Marine ecosystems face numerous grow- 
ing and synergistic threats. Thus, protecting 
chemosynthetic and nonchemosynthetic eco- 
systems requires an improved understanding 
of their resiliency and fragility. m 
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ECOLOGY AND EVOLUTION 


A new polar 
bear population 


Can the international 
conservation agreement 
protect these bears? 


By Elizabeth Peacock 


n 1965, in Fairbanks, Alaska, biolo- 

gists from around the Circumpolar 

North convened the First International 

Scientific Meeting on the Polar Bear (J). 

Scientists from the USSR, the United 

States, Denmark, Norway, and Canada 
gathered in urgency, in the midst of the Cold 
War, to strategize how to protect the polar 
bear, whose populations were declining rap- 
idly because of hunting for sport. This land- 
mark meeting led to the 1973 international 
Agreement on the Conservation of Polar 
Bears (hereafter, the Agreement), one of the 
first international wildlife agreements and 
the first political agreement among Arctic 
nations. The Agreement promised to pro- 
tect polar bears from unregulated harvest 
and to collaborate on a global scale on re- 
search and conservation (J). On page 1333 
of this issue, Laidre et al. (2) testify to the 
Agreement and the enduring hope of those 
scientists who met in Fairbanks in a para- 
mount global collaboration. 

The original Agreement was a resound- 
ing success, resulting both in the rebound 
of polar bear populations from the hunt- 
ing of the mid-20th century [e.g., (3)] and 
in global scientific collaboration [e.g., (4)]. 
The Polar Bear Specialist Group, now under 
the auspices of the International Union for 
Conservation of Nature (J), continues to 
meet every 3 to 4 years to give updates on 
the research and management of the global 
populations of polar bears. 

Through Laidre et al’s collection of data in 
difficult field conditions in a remote corner of 
the globe and through analysis of 37 years of 
data from the circumpolar Arctic collected by 
hundreds of technicians and biologists, often 
at risk to their own lives, a new and distinc- 
tive polar bear population—the 20th—has 
been identified in southeast Greenland (see 
the photo). Laidre et al. detail the character- 
istics of this population of polar bears, which 
diverged from other populations just a few 
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bioavailable energy and the toxicity of the 
seep petroleum. Such seep specialists com- 
pete poorly against nonspecialists outside 
the seep environs and against species that 
opportunistically use both chemosynthetic- 
derived and Sun-derived energy at shallow 
seepage sites (9). 

As the already dominant anthropogenic 
petroleum footprint continues expanding, 
these chemosynthetic niches grow at the ex- 
pense of endangered and protected species 
and ecosystems. This threat occurs both di- 
rectly by toxicity and indirectly by ecosystem 
adaptations that shift species composition— 
as is observed in seep communities. Such 
shifts affect the food supply of predators such 
as fish and crabs, which also are under stress 
from non-oil pollution and factors like ocean 
acidification and warming oceans. 

In response, laws, regulations, and poli- 
cies globally seek to minimize anthropo- 
genic releases (10). Mitigation falls into two 
distinct approaches: proactively—through 
regulation; and reactively—through oil 
spill response and penalties. The latter’s 
success is mixed at best for large offshore 
spills (2), with mitigation approaches 
having negative environmental impacts 
that should be assessed against natural 
mitigation (12). However, societal concerns 
inevitably also influence response deci- 
sions (and regulation development and 
implementation). 
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a globally available tool to support environ- 
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reduce anthropogenic oil and its impacts. 

The current understanding of the ecologi- 
cal impacts of the typical small marine oil 
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gists from around the Circumpolar 

North convened the First International 

Scientific Meeting on the Polar Bear (J). 
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gathered in urgency, in the midst of the Cold 
War, to strategize how to protect the polar 
bear, whose populations were declining rap- 
idly because of hunting for sport. This land- 
mark meeting led to the 1973 international 
Agreement on the Conservation of Polar 
Bears (hereafter, the Agreement), one of the 
first international wildlife agreements and 
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nations. The Agreement promised to pro- 
tect polar bears from unregulated harvest 
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Agreement and the enduring hope of those 
scientists who met in Fairbanks in a para- 
mount global collaboration. 
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ing success, resulting both in the rebound 
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the research and management of the global 
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difficult field conditions in a remote corner of 
the globe and through analysis of 37 years of 
data from the circumpolar Arctic collected by 
hundreds of technicians and biologists, often 
at risk to their own lives, a new and distinc- 
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hundred years ago. The southeast 
Greenland population’s functional 
isolation is further supported by the 
following: an expansive analysis of 
movement data from bears wear- 
ing GPS collars; the demography of 
observed, biopsied, and captured 
bears; sea ice dynamics observed 
from satellite images; and hunting 
patterns of local Inuit. Through so- 
phisticated habitat analysis, natural 
history observation, and embrace 
of traditional ecological knowledge, 
the authors recognized a previ- 
ously unreported polar bear behav- 
ior—namely, the use of fresh ice at 
the marine terminal glacial fronts, 
also known as glacial mélange, as 
a platform to hunt seals. This gla- 
cial mélange enables the bears to 
succeed in a region where current 
sea ice conditions would not allow 
the bears to persist otherwise. This 
demonstrates that the habitat of 
southeast Greenland bears is already 
a climate refugium, suggesting that 
this behavioral attribute has served as a buf- 
fer against local extinction. If the prolonged 
persistence of polar bears in high Arctic sea 
ice refugia is possible, it would likely require 
similar behavioral plasticity. 

Polar bears have been observed to display 
varied behaviors elsewhere, for example, in 
the southern Beaufort population in Alaska, 
where the bears are shifting from sea ice to 
land maternity dens (5), and in the Barents 
Sea, east of the Norwegian archipelago 
Svalbard, where bears have developed sepa- 
rate sea ice and coastal life histories (6). 
Indeed, whereas the ringed seal (Pusa his- 
pida) is the most common prey of polar bears, 
diverse diets do exist, and populations with 
more diverse diets are expected to be less vul- 
nerable to climate change (7). Polar bears of 
the Davis Strait in Labrador were determined 
to be less vulnerable with their more diverse 
diet (7), and indeed their survival rates have 
been tied to a less common prey—the harp 
seal (Pagophilus groenlandicus) (3). 

Yet the polar bear’s long generation time 
will challenge its ability to adapt under nat- 
ural selection at a rate as fast as the climate 
is changing. In other words, as sea ice (and 
glacial mélange) disappears, so will the po- 
lar bear (1). Animals with a long generation 
time have evolved low variance in survival 
and reproduction because the population 
growth rate is most sensitive to these vi- 
tal rates. In polar bears, the most influen- 
tial parameter is the survival rate of adult 
females (8). These vital rates are under 
strong natural selection and provide a buf- 
fer against typical environmental variation. 

Although Laidre et al. found that south- 
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Laidre et al. have identified a distinctive and isolated population of polar 
bears in southeast Greenland. 


east Greenland’s bears have adult female 
body mass, litter sizes, and cub survival 
rates comparable to other populations, the 
bears have substantially fewer adult females 
with newborn cubs in the spring. The au- 
thors suggest that this may be due to mate- 
finding difficulties in the small population 
(estimated to be in the hundreds). However, 
the density of bears in this relatively small 
area (27,749 km’; ~8 bears per 1000 km?) is 
on par with other polar bear densities (9) 
and unlikely to be low enough now to affect 
their ability to find mates (10). 

Alternatively, the fewer females with new- 
borns in southeast Greenland could be at- 
tributed to prudent parenting (17)—deferred 
breeding to avoid trading on survival—pos- 
sibly through resorption of a pregnancy in 
the fall based on body condition, which is 
a known phenomenon in polar bears (12). 
An examination of long-lived Antarctic 
mammals and birds found much support 
for this demographic-buffering hypothesis 
during rapid sea ice decline, except for the 
Antarctic fur seal (Arctocephalus gazella), 
which showed both decreasing adult female 
survival and propensity to breed. But could 
such demographic buffering of adult survival 
in southeast Greenland mitigate the effects of 
rapid habitat change on population growth, 
or have polar bears’ vital rates evolved to be 
too constrained? More work is necessary to 
estimate the survival of adult bears in south- 
east Greenland to answer this question. 

The question of plasticity versus adapta- 
tion in the polar bear’s changing environ- 
ment is just one question in polar bear ecol- 
ogy that has been generated by Laidre et al’s 


expansive research on the southeast 
Greenland population. Will the 
population continue to persist in a 
refugium without sea ice habitat, 
and does southeast Greenland rep- 
resent a fifth distinct ecoregion of 
polar bears (13)? Further, ecologists 
could study southeast Greenland as 
a model refugium for areas of the 
high Arctic that have been proposed 
as climate refugia. 

Notwithstanding interesting fu- 
ture research, the conservation of 
southeast Greenland’s bears is im- 
perative for their intrinsic value, 
genetic diversity, and distinctive 
ecology. In 1993, the task for col- 
laboration among the world’s polar 
bear scientists and managers was 
refocused with the warning of po- 
tential impacts of climate warming 
on polar bears (14), many of which 
have unfortunately been  subse- 
quently demonstrated [e.g., (75)]. It 
is unclear whether the Agreement, 
which had been so successful in 
bringing back the global populations from 
overharvest, will be an adequate mechanism 
to protect polar bears in the face of reduced 
habitat due to climate change. The popula- 
tion of southeast Greenland is a small, geneti- 
cally distinct group of bears with a distinctive 
ecology. In the conservation of this polar bear 
population, we find an explicit test of the 
modern influence of the half-century-old in- 
ternational Agreement on the Conservation 
of Polar Bears. & 
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Chip-scale power booster for light 


A miniature amplifier with high power is realized with an ion-doped waveguide 


By Jungwon Kim 


elecommunications, the internet, and 

various streaming services are all 

made possible by ultrabroadband opti- 

cal fiber networks that span more than 

a million kilometers and connect the 

globe. Erbium-doped fiber amplifiers 
(EDFAs) are one of the most important tech- 
nologies for global data transfer (J). Mears et 
al. demonstrated EDFAs in 1987 (2), which 
not only unlocked the full potential of ultra- 
high data-rate optical transfer over a long 
fiber, but also advanced high-performance 
laser sources, such as mode-locked erbium- 
doped fiber lasers and frequency combs (3). 
On page 1309 of this issue, Liu et al. (4) report 
the realization of an ultracompact chip-scale 
version of the EDFAs, with a performance 
already comparable to that of commercially 
available EDFA modules. 

Given the enormous success of ion-doped 
fiber amplifiers, a reasonable next step is to 
use the same ion doping in a smaller inte- 
grated waveguide device for chip-scale ampli- 
fication. Since the 1990s (5), many attempts 
were made to realize chip-scale erbium- 
doped waveguide amplifiers (6). However, it 
turned out to be quite challenging to make a 
waveguide amplifier that competes with the 
well-established EDFAs. The most problem- 
atic of several interconnected issues was the 
substantial waveguide loss, which limited the 
length of a usable ion-doped waveguide and 
therefore the achievable output power level. 
Whereas the waveguide amplifiers had a de- 
cent net gain for small input signal power, 
the output power has mostly been less than a 
milliwatt. As a result, a high-power and low- 
noise on-chip amplifier has been a missing 
element for photonic integrated circuits. 

Liu et al. have solved this long-standing 
problem and report a record-high output- 
power erbium-doped waveguide amplifier 
(see the photo). The maximum on-chip out- 
put power of 145 mW is obtained when 2.61 
mW of input power is applied under 245 mW 
of pump power. This high output power, as 
well as a high power conversion efficiency of 
~60%, could be achieved by a tiny photonic 
chip with a 21-cm-long erbium-doped silicon 
nitride waveguide. The authors were able to 
pack a spiral of erbium-doped silicon nitride 
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waveguide up to 50 cm long in a compact 
footprint of only 1.2 mm by 3.6 mm. 

Crucial to this breakthrough was the ability 
to make an ultralow-loss silicon nitride wave- 
guide. By exploiting their earlier work (7), Liu 
et al. show only a 5 dB/m loss in their silicon 
nitride waveguide. After the ion implanta- 
tion and subsequent processes, this ultralow 
waveguide loss was maintained, allowing the 
amplification process to achieve a more than 
30-dB on-chip small-signal net gain. 

Liu et al. further show two applications 
of their on-chip erbium-doped waveguide 
amplifier by amplifying the output of their 
microcombs (8). Microcombs can generate a 
soliton pulse train with a repetition rate of 
tens to hundreds of gigahertz, but their out- 
put power is limited to only tens to hundreds 
of microwatts, necessitating amplification in 
almost every application. The authors’ first 


Shown is an erbium-doped waveguide amplifier on a 
photonic integrated chip of 1-cm-by-1-cm size, with 
green emission from optically excited erbium ions. 


demonstration was low-noise microwave 
generation, in which a higher optical power 
is required to scale the noise floor in the op- 
tical-to-electronic conversion process at the 
photodiode (9). The authors could scale the 
phase noise floor of the generated microwave 
by 40 dB using the on-chip amplifier, sug- 
gesting that complete on-chip photonic mi- 
crowave generation might be one step closer. 
The second demonstration was the direct 
amplification of more than 20 wavelength-di- 
vision multiplexing (WDM) channels for data 
transmission over a 1-km-long fiber link, indi- 
cating the viability of compact chip-scale am- 
plification for telecommunication networks. 
One intriguing potential for this on-chip 
waveguide amplifier is an on-chip femto- 
second mode-locked laser. Although mi- 
crocombs can generate optical pulses with 
a repetition rate of tens of gigahertz and 


higher, scaling their repetition rate below 
10 GHz is difficult. For generating ultralow- 
noise optical pulses with repetition rates 
ranging from a few megahertz to a few giga- 
hertz, mode-locked fiber lasers (J0) with 
a footprint of tens of centimeters on each 
side have been extensively developed to 
the commercial level and widely used. The 
erbium-doped waveguide amplifier might 
allow the implementation of such ultralow- 
noise mode-locked lasers on a chip with a 
footprint of less than a square centimeter. 
One obstacle that remains is managing the 
strong nonlinearity in a silicon nitride wave- 
guide. Finding a way to actively tame and 
utilize the nonlinearity using a combination 
of waveguide dispersion engineering and fil- 
tering might lead to different and interesting 
on-chip mode-locked lasers. 

The on-chip amplifier does require an off- 
chip pump diode and a pump-coupling fiber 
coupler (WDM coupler). This means that the 
entire amplifier unit is not completely on a 
chip yet. However, given the rapid advances 
in heterogeneous integration capabilities (11, 
12), a full on-chip amplifier might also be 
possible. Because the main techniques of ion 
doping and ultralow-loss silicon nitride wave- 
guide fabrication are mature, large-volume 
manufacturing of such devices should be 
possible. Finally, the doping strategy can be 
extended to a variety of different forms. For 
example, codoping with other rare-earth ions 
or heterogeneous integration with thin-film 
lithium niobate will result in amplification 
of different wavelength ranges or improved 
compatibility for microwave photonic system 
on a chip, respectively. 
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Modifying insulin to improve performance 


A platform to enable precise modifications of insulin could improve drug functionality 


By Nai-Pin Lin and Danny Hung-Chieh Chou 


he emergence of protein therapeu- 

tic drugs presents an opportunity 

for site-specific chemical modifica- 

tions to improve their functionality. 

For example, insulin derivatives can 

provide pharmacokinetic _ benefits 
over native insulin for people with type 1 
diabetes (7). Most new insulin analogs are 
produced by total chemical synthesis, and 
the long ad hoc approaches to add modifi- 
cations impede scaling up production for 
future pharmaceutical applications (2). A 
systematic functionalization approach will 
facilitate access to new insulin derivatives 
with high efficiency and scalability. On page 
1321 of this issue, Fryszkowska et al. (3) 
demonstrate that the three amino groups 
in insulin can be selectively modified to es- 
tablish a complete combination of insulin 
analogs, providing a platform for function- 


alization that could be applied to other pro- 
tein therapeutics. 

Insulin is an anabolic peptide hormone 
that promotes glucose uptake, glycogen- 
esis, lipogenesis, and protein synthesis. It 
is injected subcutaneously to treat diabetes. 
Pharmacokinetically, it has a short half- 
life, which can be extended through insulin 
modifications. For example, insulin can be 
engineered to have a longer serum half- 
life through increased binding to serum 
albumin. There are three amines—reactive 
groups used for modifications—in insulin, 
including two amino termini of the A and 
B chains and a Lys” on the B chain. To se- 
lectively modify amino groups, nearly all re- 
ported methods can only target one or a few 
amines among all available amino groups, 
with limited scalability (4, 5). US Food and 
Drug Administration (FDA)-approved insu- 
lin drugs, such as insulin detemir and in- 
sulin degludec, have aliphatic modifications 


A toolkit for insulin modifications 


Multiple penicillin G acylase (PGA) variants were generated by Fryszkowska et al. to specifically modify each 
of the three NH, groups in insulin. This enables the addition of new properties to insulin, including partial insulin 
receptor activation, glucose responsiveness, glucose-sensitive mannose receptor binding (to regulate insulin 


availability), and extended serum half-life. 
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at the Ne amino group of the only lysine 
residue, which increase their affinity with 
serum albumin. However, there are still 
no efficient methods for selective modifi- 
cations on the two amino-terminal amino 
groups on large scales (6, 7). It is even more 
difficult to achieve multiple modifications. 
Penicillin G acylase (PGA) is an enzyme 
that performs acylation and hydrolysis on 
amino groups. Fryszkowska et al. used di- 
rected evolution techniques to identify 
multiple PGA variants that can either se- 
lectively protect native insulin at the Ne 
amino group of lysine, on the amino termi- 
nus of either chain, or selectively deprotect 
the fully protected insulin on the Ne amino 
group of lysine and on the amino terminus 
of either chain. The set of PGA variants can 
generate all possible monoprotected insulin 
analogs, three diprotected insulin analogs 
on any two amino groups, and the tripro- 
tected insulin analogs through only one en- 
zymatic step, and are highly scalable. Using 
this toolbox, the synthesis of antidiabetic 
insulin candidates, MK-1092 and MK-5160 
(8), was achieved through a systematic se- 
lective protection-conjugation-deprotection 
pipeline on large scales. MK-1092 and MK- 
5160 are dimeric insulin and demonstrate 
partial agonistic properties and metabolic 
benefits in mouse models of diabetes. This 
work demonstrates the feasibility of modi- 
fying any amino group on insulin with high 
efficiency and substantially broadens access 
to insulin bioconjugates for future studies. 
This suite of PGA variants also extends 
the scope of insulin derivatives to enable 
new features beyond improved pharma- 
cokinetics. An insulin analog modified 
with both a diol and a phenylboronic acid 
group on two separate amino groups was 
recently reported (9). This dual-modified 
insulin analog can undergo conformational 
changes in the presence of fructose. This 
is an important milestone toward the de- 
velopment of glucose-responsive insulin, a 
much-anticipated therapy that could avoid 
dangerous side effects of current therapy 
such as hypoglycemia. However, its syn- 
thesis includes multiple steps of protein 
expression, proteolysis, and chemical modi- 
fications with limited efficiency. The PGA 
variants developed by Fryszkowska et al. 
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could therefore facilitate access to sophis- 
ticated insulin derivatives and speed up the 
discovery of glucose-responsive insulin by 
adding glucose-recognizing groups. 

The study of Fryszkowska et al. opens 
the door to the development of new ther- 
apeutic proteins with modifications that 
can be manufactured at large scales (see 
the figure). There remain challenges with 
the current suite of PGA variants. In the 
case of insulin, there is increasing interest 
in modifications of the carboxyl terminal 
of the A chain, which would not affect re- 
ceptor binding affinity and would tolerate 
larger modifications such as the addition of 
antibody fragments (0). A new PGA variant 
is likely required to selectively modify the 
amino group on an added lysine residue in 
this region. Furthermore, the three amino 
groups on insulin are either on or close to 
the terminal ends. It remains to be seen 
whether new PGA variants can selectively 
recognize and modify lysine residues in the 
middle of a protein, further enhancing the 
scope of applications. 

This work presents tremendous protein 
therapeutic opportunities beyond insulin. 
One intriguing target is interleukin-2 (IL- 
2), an FDA-approved drug since 1992 that 
is used to treat metastatic renal cell carci- 
noma. Incorporating unnatural amino acids 
to allow the addition of polyethylene glycol 
(PEG), engineered IL-2 with site-specific 
PEGylation enhanced antitumor properties 
in mice (17). Engineered PGA variants may 
lead to new IL-2 derivatives without the need 
to engineer proteins with unnatural amino 
acids. Glucagon-like peptide 1 (GLP-1) is a 
gut hormone involved in metabolic regula- 
tion and satiety. Recently, modified GLP-1 
drugs, such as semaglutide that has aliphatic 
modifications on lysine for half-life exten- 
sion, have shown clinical success for treat- 
ing diabetes and obesity, demonstrating how 
chemical modifications can provide better 
clinical outcomes (72). With this new plat- 
form, more peptide and protein derivatives 
are expected to be developed. 
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Cellular cross-talk in heart repair 


The cytokine meteorin-like (METRNL) improves cardiac 
function after heart attack in mice 


By Deepak Srivastava!234 


eart disease is the number one cause 
of death worldwide, and more than 
25 million survivors suffer from 
heart failure, most commonly be- 
cause of vascular occlusion that 
causes damage to the heart. The 
heart has limited capacity to regenerate af- 
ter damage, but stimulation of new blood 
vessel formation (angiogenesis) after a 
heart attack, or myocardial infarction, is 
an essential part of the repair process and 
might be a therapeutic approach to im- 
prove heart function after injury. On page 
1343 of this issue, Reboll et al. (2) show that 
upon cardiac injury in mice, monocytes 
and macrophages migrate to 
the heart, where they secrete 
the cytokine meteorin-like 
(METRNL), which stimulates 
the expansion of a subset of 
vascular endothelial cells 
expressing the KIT receptor, 
resulting in an angiogenic 
response that limits damage. 
They show that METRNL is 
a ligand for KIT, whose ac- 
tivation is necessary for the 
normal angiogenic response 
after myocardial infarction 
and could be harnessed for 
therapeutic intervention. 
Heart disease comes in 
many flavors, including car- 
diomyopathies resulting from 
intrinsic muscle defects, structural anoma- 
lies, and ischemic heart disease involving 
insufficient blood flow to the heart result- 
ing in myocardial infarction. Each of these 
can cause heart failure, where the heart’s 
pumping is insufficient to meet the body’s 
demands. The interest in stimulating car- 
diac angiogenesis in ischemic heart dis- 
ease has a long history (2). Initial efforts to 
promote angiogenesis with vascular endo- 
thelial growth factor A (VEGFA) met with 
difficulty related to the use of first-genera- 
tion viral vectors and the development of 
fragile vessels. More recently, messenger 
RNA delivery of VEGFA has been tested 
in preclinical models and is advancing in 
clinical trials (3). 
Reboll e¢ al. built on the recognition that 
cell-cell communication between myeloid 


“.Uelivering 
the meteorin- 
like (METRNL) 
protein increased 
angiogenesis 
and improved 
cardiac function 
in mice after 
acute myocardial 
infarction.” 


cells in the heart and other nonmyocytes 
may play a role in the injury response (4). 
Circulating monocytes and macrophages 
are recruited to sites of cardiac injury, 
where they secrete factors that act non- 
cell autonomously to affect cardiac remod- 
eling. The authors searched for secreted 
proteins that might mediate aspects of this 
cellular cross-talk in mouse infarcts and 
in human tissue from patients with acute 
myocardial infarction, and they found that 
METRNL expression was prominently in- 
creased. Cells expressing METRNL were 
located near blood vessels, suggesting 
that they may affect angiogenesis. Using 
mice with genetic ablation of Metrnl and 
mice receiving bone marrow cells lack- 
ing Metrnl, the authors show 
that METRNL produced by 
circulating myeloid cells that 
take up residence in the heart 
is necessary for the normal 
angiogenic response during 
cardiac repair (see the fig- 
ure). Notably, delivering the 
METRNL protein increased 
angiogenesis and improved 
cardiac function in mice after 
acute myocardial infarction. 
Although METRNL has 
previously been described as a 
secreted immunomodulatory 
cytokine (5), and one with 
cardioprotective features (6), 
its receptor had not been 
identified. Through a variety 
of approaches, Reboll et al. demonstrate 
that METRNL is a ligand for the KIT re- 
ceptor, which also responds to the secreted 
protein stem cell factor (SCF) (7). SCF func- 
tions to maintain KIT-expressing hema- 
topoietic stem cells during development 
and for maintenance in their bone mar- 
row niche. Both METRNL and SCF could 
stimulate expansion of KIT-expressing 
mouse endothelial cells and did so through 
activation of similar downstream signaling 
pathways involving AKT phosphorylation. 
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could therefore facilitate access to sophis- 
ticated insulin derivatives and speed up the 
discovery of glucose-responsive insulin by 
adding glucose-recognizing groups. 

The study of Fryszkowska et al. opens 
the door to the development of new ther- 
apeutic proteins with modifications that 
can be manufactured at large scales (see 
the figure). There remain challenges with 
the current suite of PGA variants. In the 
case of insulin, there is increasing interest 
in modifications of the carboxyl terminal 
of the A chain, which would not affect re- 
ceptor binding affinity and would tolerate 
larger modifications such as the addition of 
antibody fragments (0). A new PGA variant 
is likely required to selectively modify the 
amino group on an added lysine residue in 
this region. Furthermore, the three amino 
groups on insulin are either on or close to 
the terminal ends. It remains to be seen 
whether new PGA variants can selectively 
recognize and modify lysine residues in the 
middle of a protein, further enhancing the 
scope of applications. 

This work presents tremendous protein 
therapeutic opportunities beyond insulin. 
One intriguing target is interleukin-2 (IL- 
2), an FDA-approved drug since 1992 that 
is used to treat metastatic renal cell carci- 
noma. Incorporating unnatural amino acids 
to allow the addition of polyethylene glycol 
(PEG), engineered IL-2 with site-specific 
PEGylation enhanced antitumor properties 
in mice (17). Engineered PGA variants may 
lead to new IL-2 derivatives without the need 
to engineer proteins with unnatural amino 
acids. Glucagon-like peptide 1 (GLP-1) is a 
gut hormone involved in metabolic regula- 
tion and satiety. Recently, modified GLP-1 
drugs, such as semaglutide that has aliphatic 
modifications on lysine for half-life exten- 
sion, have shown clinical success for treat- 
ing diabetes and obesity, demonstrating how 
chemical modifications can provide better 
clinical outcomes (72). With this new plat- 
form, more peptide and protein derivatives 
are expected to be developed. 
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function after heart attack in mice 
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eart disease is the number one cause 
of death worldwide, and more than 
25 million survivors suffer from 
heart failure, most commonly be- 
cause of vascular occlusion that 
causes damage to the heart. The 
heart has limited capacity to regenerate af- 
ter damage, but stimulation of new blood 
vessel formation (angiogenesis) after a 
heart attack, or myocardial infarction, is 
an essential part of the repair process and 
might be a therapeutic approach to im- 
prove heart function after injury. On page 
1343 of this issue, Reboll et al. (2) show that 
upon cardiac injury in mice, monocytes 
and macrophages migrate to 
the heart, where they secrete 
the cytokine meteorin-like 
(METRNL), which stimulates 
the expansion of a subset of 
vascular endothelial cells 
expressing the KIT receptor, 
resulting in an angiogenic 
response that limits damage. 
They show that METRNL is 
a ligand for KIT, whose ac- 
tivation is necessary for the 
normal angiogenic response 
after myocardial infarction 
and could be harnessed for 
therapeutic intervention. 
Heart disease comes in 
many flavors, including car- 
diomyopathies resulting from 
intrinsic muscle defects, structural anoma- 
lies, and ischemic heart disease involving 
insufficient blood flow to the heart result- 
ing in myocardial infarction. Each of these 
can cause heart failure, where the heart’s 
pumping is insufficient to meet the body’s 
demands. The interest in stimulating car- 
diac angiogenesis in ischemic heart dis- 
ease has a long history (2). Initial efforts to 
promote angiogenesis with vascular endo- 
thelial growth factor A (VEGFA) met with 
difficulty related to the use of first-genera- 
tion viral vectors and the development of 
fragile vessels. More recently, messenger 
RNA delivery of VEGFA has been tested 
in preclinical models and is advancing in 
clinical trials (3). 
Reboll e¢ al. built on the recognition that 
cell-cell communication between myeloid 
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cells in the heart and other nonmyocytes 
may play a role in the injury response (4). 
Circulating monocytes and macrophages 
are recruited to sites of cardiac injury, 
where they secrete factors that act non- 
cell autonomously to affect cardiac remod- 
eling. The authors searched for secreted 
proteins that might mediate aspects of this 
cellular cross-talk in mouse infarcts and 
in human tissue from patients with acute 
myocardial infarction, and they found that 
METRNL expression was prominently in- 
creased. Cells expressing METRNL were 
located near blood vessels, suggesting 
that they may affect angiogenesis. Using 
mice with genetic ablation of Metrnl and 
mice receiving bone marrow cells lack- 
ing Metrnl, the authors show 
that METRNL produced by 
circulating myeloid cells that 
take up residence in the heart 
is necessary for the normal 
angiogenic response during 
cardiac repair (see the fig- 
ure). Notably, delivering the 
METRNL protein increased 
angiogenesis and improved 
cardiac function in mice after 
acute myocardial infarction. 
Although METRNL has 
previously been described as a 
secreted immunomodulatory 
cytokine (5), and one with 
cardioprotective features (6), 
its receptor had not been 
identified. Through a variety 
of approaches, Reboll et al. demonstrate 
that METRNL is a ligand for the KIT re- 
ceptor, which also responds to the secreted 
protein stem cell factor (SCF) (7). SCF func- 
tions to maintain KIT-expressing hema- 
topoietic stem cells during development 
and for maintenance in their bone mar- 
row niche. Both METRNL and SCF could 
stimulate expansion of KIT-expressing 
mouse endothelial cells and did so through 
activation of similar downstream signaling 
pathways involving AKT phosphorylation. 
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Notably, the phosphoproteomic signatures 
stimulated by METRNL and SCF were 
distinct from those induced by VEGFA in 
endothelial cells, suggesting that although 
METRNL and VEGFA can trigger expan- 
sion of endothelial cells, they may have 
distinct functions during angiogenesis. 
Although SCF could stimulate endothelial 
cell proliferation, it was not up-regulated 
after injury in vivo in mice or in ex vivo 
human samples and therefore did not sub- 
stitute for METRNL after infarction. Only 
a specific subset of the monocyte and mac- 
rophage population secreted high amounts 
of METRNL [marked by high expression 
of C-C chemokine receptor 2 (CCR2)], and 
only a subset of vascular endothelial cells 
were responsive and expressed the KIT re- 
ceptor. How injury is sensed and results in 


repair in the setting of chronic ischemia 
where there is not an acute injury. 

Despite these questions, it is intriguing to 
consider METRNL as a potential systemic 
biologic therapy in the setting of acute 
myocardial infarction to limit the dam- 
age caused by coronary occlusive events. 
Fortunately, in most cases, patients can be 
quickly revascularized through placement 
of coronary stents, but for those who are 
unable to seek medical care in time, pro- 
angiogenic therapies could be valuable. In 
other more-chronic cases of cardiac isch- 
emia, vascular narrowing is sometimes not 
amenable to intervention with stents, in 
which case a robust proangiogenic therapy 
could be even more important. However, 
it remains to be seen whether endothe- 


lial cells express KIT in a chronic setting, 
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METRNL activation in specific cells will be 
an important area of further study, as will 
investigations regarding the mechanisms 
underlying the expansion in KIT* endothe- 
lial cells after injury. 

Unlike past treatment with VEGFA in 
animals that involved direct intramyo- 
cardial injection, it is noteworthy that 
systemic delivery of METRNL with an 
infusion pump had sufficient effects on 
the cardiac angiogenic response in mice. 
Given the likelihood that METRNL also in- 
duces KIT* endothelial cell proliferation in 
other organs, it will be interesting to know 
whether systemic delivery induces angio- 
genesis outside the heart. Because there 
are fewer KIT* endothelial cells at baseline, 
it is possible that in the uninjured setting, 
endothelial cells are unable to respond to 
METRNL, in which case METRNL would 
be relatively inert. Although this may be 
beneficial, it may also limit utility for heart 
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A cytokine promotes angiogenesis after heart attack 
Circulating myeloid cells, including monocytes and macrophages, expressing C-C 
chemokine receptor 2 (CCR2) are recruited to the heart after acute injury. They secrete 
the cytokine meteorin-like (METRNL), which binds to KIT receptors on endothelial 

cells (ECs), resulting in EC proliferation and angiogenesis. Increased blood flow leads 
to less cardiac damage and improved heart function in mice. 
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which would be a prerequisite to METRNL 
activity. Nevertheless, the discovery of the 
myeloid-endothelial interaction and the 
central role of METRNL-KIT signaling is 
exciting and opens new avenues for both 
understanding injury responses and devel- 
oping therapies for heart disease. 
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METAMATERIALS 


Turning 
materials 
into robots 


Researchers showcase a 
3D-printed millirobot 

made out of multifunctional 
metamaterials 


By Ahmad Rafsanjani 


nstead of building a robot with separate 
parts that provide different functions, a 
robot built with multifunctional meta- 
materials has certain advantages (1). A 
metamaterial is a synthetic structure 
made of repeating patterns designed 

to exhibit desired macroscopic properties. 
Unlike bulk materials, the behavior of meta- 
materials is governed by their engineered 
architecture rather than purely driven by 
their material makeup. Additive manu- 
facturing techniques, such as 3D printing, 
have accelerated the fabrication of intri- 
cate metamaterials at increasingly smaller 
scales with unprecedented functionalities. 
Building robots has traditionally relied on 
assembling discrete actuators, sensors, mi- 
croprocessors, and power supplies. Robotic 
metamaterials challenge this paradigm 
by conceiving autonomy into the periodic 
structure of metamaterials. On page 1287 of 
this issue, Cui et al. (2) present the design 
and methods for 3D printing a mobile ro- 
botic metamaterial that can navigate auton- 
omously and interact with its surroundings. 
Historically, the research of metamateri- 
als has primarily focused on optical applica- 
tions, such as metamaterials with tunable 
optical properties beyond the capabilities of 
ordinary lenses and mirrors (3). However, 
in recent years, researchers have increas- 
ingly turned to adopting this design ratio- 
nale in other domains. Examples include 
mechanical metamaterials that can convert 
linear motion into rotation without using 
transmission gears (4) or a swarm of tessel- 
lated robots that enable on-demand adjust- 
ment of bulk properties such as stiffness or 
deformation (5, 6). Another route to creat- 
ing robotic metamaterials is the embodi- 
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Notably, the phosphoproteomic signatures 
stimulated by METRNL and SCF were 
distinct from those induced by VEGFA in 
endothelial cells, suggesting that although 
METRNL and VEGFA can trigger expan- 
sion of endothelial cells, they may have 
distinct functions during angiogenesis. 
Although SCF could stimulate endothelial 
cell proliferation, it was not up-regulated 
after injury in vivo in mice or in ex vivo 
human samples and therefore did not sub- 
stitute for METRNL after infarction. Only 
a specific subset of the monocyte and mac- 
rophage population secreted high amounts 
of METRNL [marked by high expression 
of C-C chemokine receptor 2 (CCR2)], and 
only a subset of vascular endothelial cells 
were responsive and expressed the KIT re- 
ceptor. How injury is sensed and results in 


repair in the setting of chronic ischemia 
where there is not an acute injury. 

Despite these questions, it is intriguing to 
consider METRNL as a potential systemic 
biologic therapy in the setting of acute 
myocardial infarction to limit the dam- 
age caused by coronary occlusive events. 
Fortunately, in most cases, patients can be 
quickly revascularized through placement 
of coronary stents, but for those who are 
unable to seek medical care in time, pro- 
angiogenic therapies could be valuable. In 
other more-chronic cases of cardiac isch- 
emia, vascular narrowing is sometimes not 
amenable to intervention with stents, in 
which case a robust proangiogenic therapy 
could be even more important. However, 
it remains to be seen whether endothe- 


lial cells express KIT in a chronic setting, 
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METRNL activation in specific cells will be 
an important area of further study, as will 
investigations regarding the mechanisms 
underlying the expansion in KIT* endothe- 
lial cells after injury. 

Unlike past treatment with VEGFA in 
animals that involved direct intramyo- 
cardial injection, it is noteworthy that 
systemic delivery of METRNL with an 
infusion pump had sufficient effects on 
the cardiac angiogenic response in mice. 
Given the likelihood that METRNL also in- 
duces KIT* endothelial cell proliferation in 
other organs, it will be interesting to know 
whether systemic delivery induces angio- 
genesis outside the heart. Because there 
are fewer KIT* endothelial cells at baseline, 
it is possible that in the uninjured setting, 
endothelial cells are unable to respond to 
METRNL, in which case METRNL would 
be relatively inert. Although this may be 
beneficial, it may also limit utility for heart 
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A cytokine promotes angiogenesis after heart attack 
Circulating myeloid cells, including monocytes and macrophages, expressing C-C 
chemokine receptor 2 (CCR2) are recruited to the heart after acute injury. They secrete 
the cytokine meteorin-like (METRNL), which binds to KIT receptors on endothelial 

cells (ECs), resulting in EC proliferation and angiogenesis. Increased blood flow leads 
to less cardiac damage and improved heart function in mice. 
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which would be a prerequisite to METRNL 
activity. Nevertheless, the discovery of the 
myeloid-endothelial interaction and the 
central role of METRNL-KIT signaling is 
exciting and opens new avenues for both 
understanding injury responses and devel- 
oping therapies for heart disease. 
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metamaterial is a synthetic structure 
made of repeating patterns designed 

to exhibit desired macroscopic properties. 
Unlike bulk materials, the behavior of meta- 
materials is governed by their engineered 
architecture rather than purely driven by 
their material makeup. Additive manu- 
facturing techniques, such as 3D printing, 
have accelerated the fabrication of intri- 
cate metamaterials at increasingly smaller 
scales with unprecedented functionalities. 
Building robots has traditionally relied on 
assembling discrete actuators, sensors, mi- 
croprocessors, and power supplies. Robotic 
metamaterials challenge this paradigm 
by conceiving autonomy into the periodic 
structure of metamaterials. On page 1287 of 
this issue, Cui et al. (2) present the design 
and methods for 3D printing a mobile ro- 
botic metamaterial that can navigate auton- 
omously and interact with its surroundings. 
Historically, the research of metamateri- 
als has primarily focused on optical applica- 
tions, such as metamaterials with tunable 
optical properties beyond the capabilities of 
ordinary lenses and mirrors (3). However, 
in recent years, researchers have increas- 
ingly turned to adopting this design ratio- 
nale in other domains. Examples include 
mechanical metamaterials that can convert 
linear motion into rotation without using 
transmission gears (4) or a swarm of tessel- 
lated robots that enable on-demand adjust- 
ment of bulk properties such as stiffness or 
deformation (5, 6). Another route to creat- 
ing robotic metamaterials is the embodi- 
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ment of “robotic tasks” in the structure. 
For example, one can design a metamate- 
rial whose deformation can be controlled 
with electrical signals, which can be used to 
construct prosthetics with less mechanical 
parts prone to failure. Piezoelectric materi- 
als, such as lead-zirconate-titanate (PZT), 
can generate electricity under mechanical 
stress and deform when connected to an 
electric field. However, high stiffness, heavy 
weight, and small electric field-induced 
strain impede the immediate application of 
materials such as PZT as a robotic limb. 
Cui et al. developed an approach for a 
rational design of piezoelectric metamateri- 
als made of passive, piezo-active, and con- 
ductive phases that can perform a handful 
of robotic tasks (see the photo). A desired 
deformation mode, such as a twist, was ap- 
proximated with the motion of some dis- 
crete planes passing through the structure. 
In turn, the allowed motion of these planes 
would inform how the structural phases, 
actuators, and electrodes should be orga- 
nized within the piezoelectric metamaterial 
to generate the targeted movement. In the 
classical theory of elasticity, the deforma- 
tion of an elastic material is characterized 
by extension, compression, and shearing. 
Cui et al. applied the micropolar elasticity 
that extends classical elasticity by incorpo- 
rating rotations on top of translations. This 
approach enables a more comprehensive 
evaluation of macroscopic expansion, shear, 


twist, and flexure of piezoelectric metama- 
terials to their microarchitecture, polariza- 
tion, and the applied electric field. 

To put their design strategy to the test, 
Cui et al. used a multimaterial stereolithog- 
raphy system with spatially programmed 
electrostatic charges and fabricated 3D 
ceramic lattices decorated with conduc- 
tive metals and piezoelectric properties in 
selected regions. The resulting structure is 
a robotic metamaterial that uses two-way 
conversion of an electric field to a mechani- 
cal strain for generating movement and 
sensing. The inverse piezoelectric effect 
gives the robot actuation capability, whereas 
direct and bidirectional piezoelectric effects 
enable feedback control through proprio- 
ception (self-monitoring) and exteroception 
(contact detection and remote sensing), 
respectively. The topology of metamateri- 
als allowed placing electrodes directly onto 
piezo-active struts that induced a stronger 
electric field and amplified the actuation 
strain. A proof-of-concept steerable multi- 
gait millirobot fabricated with the piezo- 
electric metamaterials showed the ability to 
move forward, turn, jump, carry cargo, and 
avoid obstacles. 

Cui et al. artfully interweaved actuation 
and perception in a lightweight miniature 
composite 3D lattice that moves around and 
senses its surroundings. The authors then 
took steps toward an untethered imple- 
mentation by devising an onboard control 


Materializing robots by robotizing matter 
The 3D-printable multifunctional metamaterial designed by Cui et al. can be used to construct simple 
and autonomous millirobots without the need for complicated assembly of multiple moving parts. 
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Conductive materials 
act as electrodes to activate 
the robotic metamaterial. 
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Inactive (unpolarized) piezoceramics 
provide a structural scaffold for the 
metamaterial, similar to what bones do for 
muscles and nerves. 


system and power supply. Such system- 
level integration, although rare, should be 
strongly encouraged to tap into the full 
potential of fast-developing robotic mate- 
rials in real-world scenarios and identify 
their shortcomings. Considering the mo- 
bile piezoelectric metamaterials discussed 
here, the wiring of piezo-active elements 
is still a limiting factor to enhancing their 
versatility, and distributing power and de- 
centralizing control are remaining barriers 
to overcome. Despite these limitations, the 
authors demonstrated that when mobility 
and untethered autonomy are not essential, 
a single piece of piezoelectric architecture 
can serve as a compact 3D printed manipu- 
lator with six degrees of freedom—that is, 
with the ability to extend translationally 
along all three axes and twist rotationally 
around all three axes. 

The study of Cui et al. highlights a 
broader view of “robotic materials” in 
which the boundary between materials and 
machines becomes indiscernible and em- 
bodied intelligence evolves into a sought- 
after material property (7). Additive manu- 
facturing of piezoelectric metamaterials 
may lead to materializing fully integrated 
robots that might eventually walk straight 
out of a 3D printer (8). Despite being made 
of hard ceramics, harnessing the internal 
architecture of metamaterials for feedback 
control of deformation can inform the de- 
sign of continuum and soft robots (9). The 
rational design approach for piezoelectric 
metamaterials can motivate the develop- 
ment of soft robotic metamaterials through 
the use of much softer electroactive poly- 
mers or magnetorheological elastomers. 
Although integrating embedded electronics 
into metamaterials remains a burden for 
some time, recent progress in distributed 
energy systems (10) can guide generating, 
storing, and harvesting power within ro- 
botic metamaterials. Cui e¢ al. bring robotic 
metamaterials closer to biological systems, 
one function at a time. 
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Get law enforcement out of 
biospecimen authentication 


Use of DNA markers in research that are used in law 
enforcement risks undermining public trust and participation 


By Debra J. H. Mathews?? and Natalie Ram? 


n response to long-standing challenges 

in biomedical research arising from 

misidentification of cell lines, the re- 

search community has broadly adopted 

a human tissue authentication stan- 

dard based on a well-characterized set 
of short tandem repeats (STRs). But in so 
doing, the research community has largely 
settled on the same genetic markers for au- 
thentication purposes that are used by law 
enforcement around the globe for forensic 
DNA analysis. This serious misstep could 
undermine public trust in science and may 
inhibit the recruitment of research partici- 
pants, particularly those from historically 
marginalized groups—many of which are 
disproportionately represented in the crim- 
inal legal system and law enforcement da- 
tabases. Authentication of research tissues 
is critically important, but using preexisting 
law enforcement marker kits may have seri- 
ous and undesirable effects that could have 
been, and could still be, avoided by develop- 
ing an independent marker set or technol- 
ogy for scientific research purposes, distinct 
and apart from the forensic set. 

The problem of cell culture misidentifi- 
cation has been recognized since the 1950s 
and was brought into sharp relief in the 
1960s and 1970s with the repeated discov- 
ery and demonstration that assumed inde- 
pendent cell lines were in fact all HeLa (J). 
The proposed solution to this substantial 
and urgent problem was cell line authenti- 
cation: standardized genetic typing to en- 
able verification that the biospecimen in 
hand is from the correct individual, and to 
ensure its identity and integrity over time, 
throughout use, development of derivatives 
(such as cell lines, derived tissues, and or- 
ganoids), and distribution to collaborators 
and the broader scientific community. 
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In response to this concern, international 
standards for cell line authentication have 
been developed, and many journals require 
authentication before publication. A 2007 
notice from the US National Institutes of 
Health (NIH) about the importance of cell 
line authentication was motivated by legiti- 
mate and long-standing concerns about the 
relationship between cell line misidentifica- 
tion and the reproducibility of research re- 
sults, noting, “Improper characterization [of 
cultured cells] can impede efforts to repli- 
cate published findings and therefore the ad- 
vancement of science” (2). In 2015, the NIH 
began requiring “authentication of key bio- 
logical and/or chemical resources,” including 
cell lines and other human tissues (3). 

Although the NIH does not require a 
particular approach to authentication of re- 
search tissues, since 2011 and updated most 
recently in 2021, the American National 
Standards Institute (ANSI) has published 
authentication standards (4), focused on 
the typing of STRs. STRs are regions of 
short, repeated DNA sequences, the length 
of which are highly variable between indi- 
viduals. By typing multiple STRs, research- 
ers can easily differentiate one individual’s 
biospecimen from another’s. In addition to 
the efficiency and effectiveness of STR pro- 
filing for research authentication, this tech- 
nology is also the most convenient and least 
expensive because of the widespread avail- 
ability of extensively validated commercial 
STR profiling kits. 

This strong commercial market for STR 
kits existed long before the authentication 
standards for research purposes. It was 
born from and tailored to law enforce- 
ment’s interest in matching crime scene 
DNA samples to DNA samples from ar- 
restees and convicted offenders. The US 
government operates the Combined DNA 
Index System (CODIS), a database devel- 
oped and maintained by the Federal Bureau 
of Investigation (FBI), through which local, 
state, and federal law enforcement entities 
can already compare crime scene DNA to 
more than 19 million known DNA profiles 


on a routine basis (5). These DNA profiles 
identify individuals arrested for or con- 
victed of specified offenses. Given the exist- 
ing standards and the products available to 
meet those standards, commercial authen- 
tication kits now sold for medical and sci- 
entific research across the market are based 
on the markers used for CODIS. 

That the research community has largely 
settled on the same genetic markers for au- 
thentication purposes that law enforcement 
uses for CODIS and other global law enforce- 
ment databases is, in some ways, unsurpris- 
ing. Using CODIS markers enabled research- 
ers to hit the ground running, without delays 
and financial investment associated with 
identifying additional, distinct markers or 
technologies for authentication purposes. 

But as a result, NIH’s authentication 
guidelines will lead to increasing numbers 
of researchers developing and reporting 
information about research subjects that 
corresponds precisely to the data that law 
enforcement typically uses to develop an 
investigative lead. The same is true for 
researchers across Europe and in Japan, 
where similar overlap exists between CODIS 
markers, local law enforcement markers, 
and those currently used for biospecimen 
authentication in research (6). Although 
there is variation in which and how many 
total markers are used by law enforcement 
in each country, a core set of seven mark- 
ers (plus Amelogenin) is shared across the 
United States, the European Union, the 
United Kingdom, and Interpol, which in- 
cludes 84 member countries from around 
the world. 


UNNECESSARY RISKS 

Data collection for biomedical research and 
for law enforcement are typically based on 
very different justifications and have very 
different goals and systems of governance. 
Protective silos have been built and bolstered 
to keep data generated by and for these par- 
ticular kinds of activities quite separate. 
However, with this overlap of authentication 
efforts and CODIS, the boundary between 
biomedical research and law enforcement 
appears to be weakening. Of note, this 
weakening is asymmetric because histori- 
cally, the FBI has largely been unwilling to 
permit researchers to access CODIS profiles 
to conduct certain kinds of research that 
could enhance, for example, understanding 
of match frequencies or the likelihood of 
false matches, which would presumably help 
further the stated goals of law enforcement 
rather than frustrate or harm them. 

We identify three clear risks with persist- 
ing in the use of CODIS markers for research 
authentication, then offer a path forward for 
authentication of research tissues that di- 
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verges from the established and likely stable 
path of law enforcement, maximizes benefit, 
and minimizes potential harm. 


Undermines efforts to recruit 

research participants from historically 
marginalized and excluded groups 

that are underrepresented in research 
Genetic research and resources are com- 
posed largely of individuals of European 
descent (7), leaving the vast genetic varia- 
tion present in non-European populations 
understudied, poorly understood, and un- 
available for the development of targeted 
genetic therapies. It is increasingly recog- 
nized that more must be done to diversify 
all aspects of genetic research, including 
the individuals who are recruited and who 
ultimately enroll as research participants. 
Qualitative research suggests that past ex- 
ploitation—including in the infa- 
mous Tuskegee syphilis study and 
of individuals such as Henrietta 
Lacks and her family, as well as 
current structural racism within 
research and health systems affect- 
ing communities of color—under- 
mine the willingness of individuals 
from historically marginalized 
communities to participate in re- 
search (8). Concomitantly, racial 
disparities have been repeatedly 
demonstrated across multiple 
components of the criminal legal 
system, including in policing, ar- 
rests, charging, and sentencing 
(9)—and subsequently in CODIS. 
CODIS aggregates profiles for 
nearly all convicted (and many 
arrested) persons throughout the 
United States, and so disparities in 
the earlier stages of the criminal legal sys- 
tem are replicated in CODIS. 

When the language used to describe peo- 
ple and samples in research is the language 
of law enforcement, that is likely to affect 
how potential research participants frame 
and understand the research enterprise 
and the risks it may entail. For populations 
disproportionately targeted by law enforce- 
ment (and thus disproportionately repre- 
sented in law enforcement DNA databases, 
among other harms), the effect is likely to 
be negative. 

That runs counter to the goals of authen- 
tication itself and the interests of science. 
Authentication is intended to ensure that 
research can be replicated and reproduced, 
which also helps ensure that participants’ 
tissue and data are used in scientifically 
rigorous research. Thus, authentication can 
be a tool for building trust with research 
participants. But using a law enforcement 
tool to further biomedical research goals 
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is likely to have the opposite result, raising 
questions about the trustworthiness of sci- 
ence by creating an indelible link between 
research and law enforcement data. Even 
if law enforcement never exploits that link, 
the fact of its existence may well erode trust 
and undermine efforts to recruit research 
participants on a broader scale. In other 
words, language matters. 


Risks drawing increased law 

enforcement interest in gaining 

access to these research data 

Law enforcement entities have repeatedly 
demonstrated an interest in and willing- 
ness to exploit genetic data for investigative 
purposes, with little regard for the origin of 
that data or the expectations of its progeni- 
tors. Consider two recent examples. The 
advent of investigative genetic genealogy by 


Authentication of human biospecimens is critically important, 
but some approaches may cause more harm than good. 


using consumer genetic data has whetted 
law enforcement appetite to use non-CODIS 
DNA repositories to generate genetic leads 
(10). Using consumer genetics platforms 
such as GEDmatch and FamilyTreeDNA, 
law enforcement and private companies 
have been able to compare crime scene 
DNA to consumer DNA profiles, investi- 
gate resulting familial connections, and—in 
more than 200 cases so far—make arrests. 
Meanwhile, investigators in San Francisco 
recently made headlines for using genetic 
data collected from sexual assault victims to 
identify at least one such victim as a suspect 
in an unrelated crime (JJ). 

Such examples conflict with a principle, 
enshrined in federal law, that special stan- 
dards of confidentiality ought to apply to 
sensitive, identifiable research data. Under 
federal law, the NIH issues Certificates of 
Confidentiality to federally funded and 
non-federally funded researchers alike. The 
relevant statute provides that researchers 


with a Certificate “shall not, in any Federal, 
State, or local civil, criminal, administra- 
tive, legislative, or other proceeding, dis- 
close or provide” identifiable, sensitive 
data they hold about research subjects (72). 
Certificates were enacted precisely to shield 
sensitive research data from law enforce- 
ment use. Yet not every researcher will have 
a Certificate, know its terms, or enforce 
it as required, and the enforceability of a 
Certificate in the face of a law enforcement 
demand remains uncertain (73). Moreover, 
it is unclear whether biobanks are appro- 
priate recipients of a Certificate at all, even 
though they are repositories of vast quanti- 
ties of sensitive, identifiable data. 

The tension—between law enforcement 
appetite to access additional sources of ge- 
netic data for investigative purposes and the 
legal and ethical commitment that research 
data demand special protections— 
indicates a need to erect practical (as 
well as legal) barriers to law enforce- 
ment use of genetic data where that 
use would be inappropriate, such as 
in the context of research. 

The use of CODIS markers for 
research authentication may ac- 
complish the opposite. Requiring 
researchers to generate data that are 
directly relevant to and usable by law 
enforcement may well invite law en- 
forcement interest in this source of 
genetic data. Indeed, some cell banks 
post CODIS marker data alongside 
the cell lines in their publicly avail- 
able catalogs. Guarding patients’ and 
research subjects’ identifying infor- 
mation so as to reduce reidentifica- 
tion risks is unlikely to be sufficient 
protection in this circumstance. 
Researchers have repeatedly demonstrated 
that reidentification of genetic data is pos- 
sible (/4). Further, were law enforcement to 
document a close or exact match between a 
research authentication profile and a crime 
scene DNA profile, a court might well order 
disclosure of identifying information associ- 
ated with that biological material. 

To be sure, any authentication technology 
or marker set will produce data that may 
attract law enforcement interest. But that 
risk is exacerbated when the data gener- 
ated are in a form that law enforcement is 
likely to deem relevant and straightforward 
to use, provided that reidentification can be 
ordered or accomplished. 


Imposes additional potential harms 

on already vulnerable populations, 
particularly children 

Among other things, public biobanks (such 
as newborn screening programs), aca- 
demic biobanks, and commercial biobanks 
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often include samples from infants and 
children. This may include healthy infants 
and children enrolled in genetic research 
by their parents, often with an intended 
benefit to a sick sibling. Genomic sequenc- 
ing projects have also enrolled healthy in- 
fants and children. 

Genetic research has usually been clas- 
sified as no more than minimal risk, sug- 
gesting that enrollment of all children in 
such research should be permissible. But 
given rapid changes in genetic technology 
and the research, commercial/consumer, 
and law enforcement landscapes, it may 
no longer be the case that such research 
should be considered minimal risk under 
current regulations. Traditional barriers 
to the movement of data from research to 
consumer to law enforcement uses have 
already begun to break down, as the use 
of consumer genetics data to solve crimes 
demonstrates (10). The use of CODIS 
markers as a matter of course for research 
authentication heightens the risks and po- 
tentially shifts the risk-benefit calculation 
of research participation. For healthy chil- 
dren, this increased risk (in the absence of 
benefit) could make their participation in 
such research ethically problematic. This 
same logic may also apply to a “Not Human 
Subject Research” designation for second- 
ary use of unconsented clinical samples. 


INNOVATE FOR INDEPENDENCE 

Risks and difficulties surrounding research 
use of CODIS markers are not inevitable. 
Although the adoption of CODIS markers 
for research authentication purposes is the 
path of least resistance, it is not the only 
path. There is no scientific reason that 
the research community should be locked 
into law enforcement’s STR platform for 
authentication at all. Ideally, researchers 
should work toward a platform-agnostic 
method of authentication that is more 
flexible and open to innovation—and com- 
pletely independent of CODIS. 

Innovation in developing STR profiles 
is not only possible, it is practicable. Take 
two examples. When the forensic com- 
munity determined that additional STRs 
would enhance its law enforcement work, 
it set about identifying those STRs. Today, 
CODIS profiles consist of 20 STR markers 
rather than the original 13. Further, these 
are not the only reliable and effective STRs 
for verifying identity. When the Coriell 
Institute for Medical Research, which 
had long used a locally developed (non- 
CODIS) six-marker set (plus Amelogenin) 
for authentication and quality control, 
was preparing to transition to a new, more 
robust set of markers for this purpose, it 
recognized the risks of adopting the CODIS 
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markers. As such, Coriell developed and 
validated a new multiplexed STR-based as- 
say for internal use that runs on the same 
platform as the original set and does not 
include CODIS markers. In combination, 
these marker sets are able to distinguish 
all 48,000 related and unrelated samples 
in the repository used for validation, dem- 
onstrating similar discriminating power to 
that of the CODIS-associated STRs for pur- 
poses of avoiding misidentification (15). 

Given the forces aligned in favor of us- 
ing CODIS (and other global law enforce- 
ment) markers for authentication, moving 
away from these markers is daunting to 
be sure. Alternatives must be developed 
and validated. A market must be built to 
bring the per-sample cost down. Legacy 
samples would need to be retyped. And 
those in a position to establish standards 
[such as ANSI and the National Institute 
of Standards and Technology (NIST)], fund 
research (such as NIH), publish research, 
and steward biospecimen collections 
would need to commit to facilitating and 
incentivizing the shift. 

But transitioning away from CODIS 
markers would enable the research com- 
munity, in this one way, to bolster rather 
than erode trust—and it is the right thing 
to do, regardless of how frequently law en- 
forcement might pursue research authen- 
tication data for investigative purposes. 
Insofar as law enforcement actively seeks 
to exploit research data, making that use 
more time-consuming and expensive by re- 
quiring different markers to be analyzed, 
may dissuade that use from becoming rou- 
tine. Even if law enforcement never pur- 
sues this use, eliminating the direct link 
between law enforcement and research 
would avoid the harm caused by the per- 
ception that link may create. 


STRENGTHEN SILOS 
The first step that needs to be taken is for 
biospecimen and cell line repositories and 
other relevant entities to immediately re- 
move all CODIS-based authentication data 
from their websites, making the informa- 
tion available only upon request. In addi- 
tion, a working group should be established 
to include representatives from NIH, ANSI, 
NIST, and other stakeholder groups to begin 
to identify viable alternatives. These could 
include non-CODIS STRs, such as the assay 
developed by Coriell, or another technology, 
such as single-nucleotide polymorphisms. 
These initial actions will ameliorate imme- 
diate potential harm and give us a basis for 
the path forward. 

In many instances, the dissolution of 
barriers between siloed data is warranted 
and advances the interests of both sci- 


ence and patients; but the disparate jus- 
tifications for collection, regulations, and 
norms that govern samples in the crimi- 
nal legal system and in research require 
that the silos around these samples and 
the data derived from them be strength- 
ened, not dissolved. The forensic DNA in 
CODIS is legally compelled from individu- 
als who have been accused or convicted 
of crimes, and it is used without consent 
against the individual from whom it was 
taken. Paradigmatically, research samples 
(and the cell lines derived from them) are 
donated by volunteers who consent to par- 
ticipate in research because they believe 
in and endorse the goals of the research 
and because they trust the person asking 
them to participate. Authentication of bio- 
specimens is and remains crucial for sound 
scientific research. But appropriating law 
enforcement tools to serve research goals 
undermines both consent and trust. As 
we have learned repeatedly over the past 
2 years, trust in science is critical to both 
individual and public health. 
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Confronting the climate crisis 


Leading literary writers explore what it means 


to live on a changing planet 


By Joseph Swift 


n ancient times, it was common to di- 
vine the future by watching animal 
behavior. For example, an unexpected 
appearance of a bear or a serpent could 
indicate the will of the gods. 
As we slide further into a fu- 
ture of climate change, there is no 
shortage of such omens. Around 
the world, animals are acting 
unusually—ticks are migrating 
northward into Canada, swarms 
of jellyfish are invading the Sea of 
Japan, and bats are dying in the 
thousands in eastern Australia. 
We do not need superstition 
to guess what type of future these 
animals are pointing to; climate 
science has provided a clear an- 
swer for decades. But given what the evidence 
predicts, one wonders whether its magnitude 
can be explained in scientific terms alone. To 
capture the impact of climate change in all 
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The World As We Knew 
It: Dispatches From 
a Changing Climate 

Amy Brady and 
Tajja Isen, Eds. 
Catapult, 2022. 288 pp. 


its existential proportions, we might need 
methods more ancient than modern science. 
Drawing on storytelling, lived experience, and 
even a bit of superstition, The World As We 
Knew It is a collection of essays from leading 
literary writers that does just that, confront- 
ing climate change with a wider 
vocabulary than science affords. 

Each essay in the book serves 
as a type of local dispatch, with 
the authors sharing how cli- 
mate change has shaped their 
own sense of place and self. On 
the surface, what they choose to 
share can appear unrelated and 
even irrelevant—readers learn 
about nuclear contamination, 
Arizona cacti, mold _ allergies, 
dam building, grief, commune 
life, raising children, and, of 
course, COVID-19. But as a collection, these 
narratives work together to shift the audi- 
ence’s perception of the environment, un- 
settling assumptions that it is something to 
be controlled or conquered. 

For example, in her essay “What We 
Don’t Talk About When We Talk About 


The future of a defining feature of the American West, 
the saguaro cactus (Carnegiea gigantea), is uncertain. 


Antarctica,” Elizabeth Rush asks why 
Antarctica is still romanticized as a place to 
demonstrate one’s personal endurance, as 
though the continent exists only as a setting 
for a form of extreme boot camp. Similarly, 
in “Come Hell,” Lacy M. Johnson hints that 
if religion treats the biosphere as a sort of 
endowment, we can become ambivalent to 
its destruction, because, regardless, “we 
please God when we use His gifts.” 

Even as they challenge the ways we ra- 
tionalize our relationship with nature, the 
book’s authors do not attempt to promote an 
alternate philosophy. While they fan out to 
assess other ways of knowing, such as what 
we can learn from feminism or Indigenous 
peoples, their musings point only toward 
encouraging ideas, not concrete solutions. 

Instead, they tend to provide readers a 
sense of climate change as something that 
is both inescapable and unconcerned with 
human endeavors. Importantly, they lead 
their audience to this understanding with- 
out using alarmist language, and in doing 
so, the vague sense of panic their writing 
instills hits deeper, throwing into relief how 
much we would rather distract ourselves 
from the fact that we are not in control. 
After all, if we really believe there is a cli- 
mate emergency, why do we spend so much 
time thinking about cryptocurrency, how 
many airline miles we have, or a journal’s 
impact factor? 

It is tricky to articulate this sort of irony, 
and, wisely, most of the authors of this 
collection do not try to define it directly. 
Instead, they opt for gentle observation. For 
example, in “Starshift,’ Gabrielle Bellot ex- 
amines her own train of thought, showing 
how it is possible to worry both about the 
growing intensity of Caribbean hurricanes 
and whether it is time to go out and buy 
new underwear. 

Some essays miss the mark, evoking an 
overwhelming sense of dread that blunts 
their intended message. And while the con- 
tributing authors are by some measures di- 
verse, most have been plucked from a nar- 
row political stratum. This is problematic, 
as discourse on science is weakened when 
constrained to one part of the intelligentsia. 

But aside from these shortcomings, The 
World As We Knew It is resoundingly ar- 
ticulate about climate change in ways that 
dispassionate scientific inquiry cannot be. 
By taking a more off-the-cuff approach, the 
essay collection is so razor sharp, one feels 
it has a chance of reaching even the most 
hardened climate skeptic. & 
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Peter Higgs and his eponymous particle 


A new biography juxtaposes the physicist’s life story with the search for the Higgs boson 


By Simon Mitton 


rank Close’s new biography of Peter 

Higgs opens with a telling incident: 

the physicist’s disappearance on the 

day the Nobel committee announced 

that he was one of the two 2013 phys- 

ics laureates. The favorite for the 
2013 award had silently slipped away from 
his Edinburgh apartment, anxious to es- 
cape media attention. He had forewarned 
his department that on the fateful day he 
would be in the Scottish Highlands. This 
was a bluff to throw nosy newshounds off 
his scent. After a refreshing three- 
mile stroll to the Firth of Forth, he 
lunched alone at a seafood restau- 
rant, leaving his colleagues to field 
a blizzard of media calls. 

Elusive: How Peter Higgs Solved 
the Mystery of Mass is notable for 
its moderation and objectivity. Two 
narratives are niftily interwoven, 
one exploring the private world of 
the modest physicist and the sec- 
ond charting the timeline (almost 
half a century) of the public search 
for the elusive boson that would 
bear his name. 

Close sensitively illustrates how 
the circumstances of Higgs’s early 
life, which included ill health and 
frequent absences from _ school, 
left an indelible mark on his adult 
persona, characterized by shy- 
ness, extreme modesty, and a love 
of solitude. 

From his father’s cast-off en- 
gineering textbooks, the eager 
youngster discovered the world of 
mathematics. Self-taught, he mas- 
tered calculus, algebra, and trigo- 
nometry years ahead of his age group. This 
formative learning would later flourish as 
a lifelong passion for deep scholarship and 
intensive study. 

In high school, Higgs became intrigued 
by the name Paul Dirac, which featured 
prominently on the honors board. Curious 
to learn more of this alumnus, he turned to 
Max Born’s Atomic Physics, for its accessi- 
ble account of Dirac’s discoveries in quan- 
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tum mechanics. He read with amazement 
of Dirac’s prescient conjecture of the exist- 
ence of antimatter, which Carl Anderson 
confirmed from observations of positron 
tracks in a cloud chamber. Decades later, 
he pinpointed this as a turning point of his 
life in physics. 

Higgs graduated first-class from King’s 
College London in 1950 and earned his 
doctorate in 1954 for theoretical research 
on the spectroscopy of organic molecules. 
Thereafter he focused on quantum field 
theory but failed to write anything of con- 
sequence for several years. 


Higgs credited his fortuitous prediction to a flash of insight. 


The middle section of the book, with its 
vivid account of the parlous state of particle 
physics theory in the mid-1960s, gives the 
context in which Higgs found himself. Here, 
Close’s description of the mathematical ma- 
nipulations employed by Higgs and others 
will challenge readers not already at ease 
with quantum mechanics and the standard 
model of particle physics. He offers useful 
tutorials and a couple of appendices that 
may help those who have the time for a sec- 
ond or third reading. Better still, just skip 
to the epilogue for the author’s delightful 
overview in plain English. 


Elusive: How Peter 
Higgs Solved 

the Mystery of Mass 
Frank Close 

Basic Books, 2022. 
304 pp. 


Higgs and many rivals struggled to com- 
prehend the fundamental particles and 
forces of our Universe. All attempts to unify 
the electromagnetic and the weak nuclear 
forces failed. A fault lurked somewhere in the 
modeling, which predicted that the particles 
(bosons) mediating the weak interac- 
tion had zero mass. Higgs viewed the 
concept of a massless boson as ridic- 
ulous, intuiting that modelers were 
overlooking something. But what? 

After 3 years of fruitless attempts 
to unpick the mistakes others had 
made, Higgs found the error in a 
flash of insight, spotting that the 
leading contender had neglected 
the influence of electromagnetism. 
It took him a weekend to plug that 
gap by invoking a hypothetical quan- 
tum field to generate mass. Later, he 
articulated his surmise as “the only 
really original idea I’ve ever had” 

Higgs became aware of rivals 
who were postulating that an un- 
usual quantum field had endowed 
bosons with mass. Luckily, he pub- 
lished first and suggested that the 
massive boson could be identified 
from its decay products. However, 
his papers failed to gain traction, 
because the Higgs mass creation 
mechanism had the appearance of 
a mathematical fudge. 

Higgs instinctively adopted a low 
profile until 1967, when Steven Weinberg 
enthusiastically cited Higgs et al. for their 
exorcising of massless bosons. By the mid- 
1970s, the path to a mathematically tidy 
(renormalizable) standard model was clear. 
The task that then lay ahead was finding the 
Higgs boson, a totem by which the validity 
of the standard model would be judged. 
Close concludes his biography with an eye- 
witness account of the momentous day, 
4 July 2012, when CERN applauded the dis- 
covery of “a particle compatible with the 
Higgs boson.” 
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Shrinking aquifers and 
land subsidence in Iran 


More than 98% of Iran’s 1.648 million 
km2 of land faces land subsidence (J, 

2). Internationally, a rate of subsidence 
greater than 4 mm per year (3) is consid- 
ered a crisis; Iran’s land is sinking at an 
astonishing rate of 6 cm per year (J) asa 
result of 25 years of water level decline in 
the plains (2, 4). Dam construction, cli- 
mate change, inefficient water consump- 
tion by agriculture and industries, and the 
use of underground aquifers as sources 
for illegal agricultural water extraction 
wells all continue to deplete Iran’s water 
table (5, 6). Iran must address its water 
levels before subsidence leads to a human- 
itarian crisis. 

In urban areas, subsidence will damage 
buildings, bridges, transportation lines, 
and energy transmission lines as well as 
reduce the earthquake resistance of build- 
ings (1, 2). Historical monuments are also 
at risk. Moreover, continued water level 
declines will reduce the ground’s water 
permeability and turn fertile plains into 
barren deserts. 

Subsidence can be managed and 
controlled with proper monitoring. To 
address the problem, Iran should save 
water by adopting mechanized irrigation 
in agriculture and managing wells to 
prevent illegal water extraction. Farmers 
should use cultivation patterns that maxi- 
mize water efficiency in the production of 
crops. Where possible, agricultural prod- 
ucts should be cultivated in greenhouses, 
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which use less water than traditional 
farming (7). Finally, to protect the aquifers 
in the plains from further water loss, Iran 
should facilitate the transfer of indus- 
tries with high water consumption to the 
shores of the Caspian Sea or the Persian 
Gulf, where desalination technologies can 
provide adequate water (8). 
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Biotechnology ethics 
for food and agriculture 


In January, the National Bioengineered 
Food Disclosure Law (NBFDL) went into 
effect, requiring all US food manufactur- 
ers to disclose whether their products 
contain bioengineered ingredients (1). 
However, the law is the subject of debate, 


Sinking land and fissures in Iran could 
lead to a humanitarian crisis. 


with critics arguing that it inhibits rather 
than increases transparency around the 
use of biotechnology in food (2). Given 
the importance of the agbiotech industry, 
the Biden-Harris administration should 
develop an agricultural biotechnology 
advisory commission devoted to inclusive 
deliberation on ethics and governance in 
agricultural and food biotechnology. 

The NBFDL is consistent with the US 
Department of Agriculture’s (USDA’s) 
Sustainable, Ecological, Consistent, 
Uniform, Responsible, Efficient (SECURE) 
rule (3), which exempts many gene-edited 
crop traits from regulation and allows 
companies to self-regulate. Similarly, the 
NBFDL grants exemptions from labeling 
for processed foods with trace percentages 
(<5%) of genetic material and allows for 
disclosure through QR codes rather than 
through a clearly identifiable symbol that 
can be read directly on a package. Several 
lawsuits alleging deficiencies in the gov- 
ernance of genetically engineered crops 
and food animals are making their way 
through the court system (2, 4). 

The United States has appointed a bio- 
ethics commission in the past, but no com- 
mission has been formed since 2017 (5). 
The urgency of reinstating the commission 
has been highlighted by National Academy 
of Sciences President Marcia McNutt and 
National Academy of Medicine President 
Victor J. Dzau (5, 6). The need for a com- 
mission addressing ethics in public health, 
biomedicine, and climate science signals 
that we should also create bioethics com- 
missions for other areas, including agricul- 
ture biotechnology. 

Commissions focusing on agricultural 
bioethics have never existed in the United 
States, but they have been successfully 
developed elsewhere. The Norwegian 
Biotechnology Advisory Board has rec- 
ommended a forward-looking regulatory 
framework for genetically modified organ- 
ism use and gene editing in agriculture 
based on extensive public consultation (7). 
The UK’s Nuffield Council on Bioethics, an 
independent forum funded by the Medical 
Research Council and the Wellcome 
Trust, has published a series of reports 
on the social and ethical issues related to 
genome editing and farmed animal breed- 
ing, elicited public responses to proposed 
regulation changes, and facilitated public 
dialogues on genome editing in farm ani- 
mals (8). The United States should use 
these examples as models for the estab- 
lishment of a presidential bioethics com- 
mission that addresses critical issues and 
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promotes inclusive engagement around 
biotechnology in food and agriculture. 
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Broaden chemicals scope 
in biodiversity targets 


On 21 June, the next round of negotia- 
tions on the post-2020 global biodiversity 
framework will be held in Nairobi (7). In 
the draft document (2) listing 21 targets, 
target 7 addresses chemical pollution 

by explicitly mentioning nutrients, pes- 
ticides, and plastic waste. Limiting this 
target’s scope to these three groups does 
not do justice to the immense variety of 
anthropogenic chemicals polluting the 
environment (3), which also include, for 
example, toxic metals, industrial chemi- 
cals, chemicals from consumer products, 
and pharmaceuticals (4), as well as the 
(often unknown) transformation products 
of substances from each group (5). We 
urge the negotiators to broaden the scope 


1280 17 JUNE 2022 + VOL 376 ISSUE 6599 


of target 7 to reflect the complexity of 
chemical pollution. 

Both direct and indirect impacts of chem- 
ical pollutants on organisms in the environ- 
ment can lead to population instability, 
possibly resulting in the decline or even 
extinction of vulnerable species. Chemical 
pollutants can also cause undesired shifts 
in community composition and/or function, 
which can affect ecosystem services (6). 
Biological adaptation to chemical exposure 
may decrease genetic diversity, reducing 
resilience to future stressors, such as global 
warming and other aspects of global change 
(7). Thus, to focus solely on nutrients and 
pesticides would detract attention from 
myriad potential interactions, beyond eutro- 


Chemical pollution mitigation plans should address 
pharmaceutical waste. 


phication and acute toxicity, which could 
negatively affect biodiversity and ecosys- 
tems at large. 

Understanding environmental exposures 
and their consequences is a formidable task 
because of the variety of pollutants and mul- 
titude of potential impacts, as well as the 
fact that chemicals occur in mixtures and 
act in conjunction with other stressors (7, 8). 
To comprehensively address these complex 
interactions, joint efforts by interdisciplinary 
teams of researchers are essential. Neither 
the scientific community nor research 
funders have fully recognized or adequately 
responded to this necessity yet. 

Environmental pollution by anthropo- 
genic chemicals has been recognized as 
a major agent of global change (9). The 
continuous rise in the creation, production, 
and use of chemicals far outpaces human- 
ity’s capacity to assess their hazards and 


risks to human health and the environ- 
ment (JO, 17). To tackle the global threats 
of chemical pollution, countries around the 
world have recently agreed on creating an 
intergovernmental science-policy panel on 
chemicals and waste, charged with consoli- 
dating existing knowledge to inform policy- 
makers (12). The available evidence already 
justifies targeting a wider scope of chemi- 
cal pollutants for strategies and action to 
be implemented in the post-2020 global 
biodiversity framework. 
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Separating spin and charge 
n one-dimensional fermionic systems, spin and charge excitations can decouple from 
each other. This so-called spin-charge separation has been detected in solids and cold- 
atom systems held in optical lattices. Senaratne et al. observed spin-charge separation in 
one-dimensional Fermi gases of lithium atoms in the absence of a lattice structure within 
the gas. The researchers were able to excite the spin and charge excitation modes inde- 

pendently from each other and measure their velocities as a function of the strength of the 

atomic interactions. —JS Science, abn1719, this issue p. 1305 


Artist's conception of a spin excitation propagating through a one-dimensional gas of fermionic atoms 


TRANSCRIPTION 
When Pol Il meets 
nucleosome 


Eukaryotic cells organize their 
large genomes into a compacted 
structure called chromatin. 

The condensed structure of 


SCIENCE science.org 


chromatin, with its fundamental 
unit, the nucleosome, repre- 
sents a challenge to nucleic 
acid-transacting machines 
including RNA polymerase || 
(Pol II), the enzyme responsible 
for the transcription of most 
protein-coding genes. How RNA 


Pol Il overcomes nucleosomes 
without disrupting chromatin 
organization remains unknown. 
Using cryo-electron microscopy, 
Filipovski et al. provide struc- 
tural snapshots of a complex 
between mammalian RNA Pol 

ll and a nucleosome that show 
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how previously transcribed DNA 
rewraps the nucleosome. The 
finding provides a structural 
basis of how nucleosomes, and 
consequently epigenetic marks, 
are retained during transcrip- 
tion. —DJ 

Science, abo3851, this issue p. 1313 


CORONAVIRUS 


First off the COVID block 


The severe acute respira- 
tory syndrome coronavirus 2 
(SARS-CoV-2) pandemic has 
been characterized by waves of 
transmission initiated by new 
variants replacing older ones. 
Given this pattern of emer- 
gence, there is an obvious need 
for the early detection of novel 
variants to prevent excess 
deaths. Obermeyer et al. have 
developed a Bayesian model to 
compare relative transmissibil- 
ity of all viral lineages. Using 
this model, emerging lineages 
can be spotted together with 
the mutations that contribute 
toward transmissibility, not 
only in Spike, but also in other 
viral proteins. The model can 
prioritize lineages as they 
emerge for public health con- 
cern. —CA 

Science, abm1208, this issue p. 1327 


MOLECULAR BIOLOGY 
How to make 
selenoproteins 


In all domains of life, the 
essential trace element 
selenium is incorporated into 
selenoproteins as the amino 
acid selenocysteine during 
protein translation. Specialized 
protein and RNA factors assist 
selenocysteine transfer RNA 
to reinterpret specific UGA 
codons, not as a signal to end 
protein synthesis, but rather 
as a sign for selenocysteine 
insertion. Hilal et a/. used cryo— 
electron microscopy to trap 
and visualize the mammalian 
ribosome as it decodes the 
selenocysteine UGA codon. An 
unforeseen extended network 
of interactions between key 
molecular players facilitates 
the recoding event, thereby 
providing a basis for further 


1281 


RESEARCH | IN SCIENCE JOURNALS 


studies of this fundamental 
biological process. —DJ 
Science, abg3875, this issue p. 1338 


BIOLOGY 
A tractable lead for lupus 


Lack of optimal immune 
defense functions and thus 
increased risk for infections is 
a known issue for patients with 
systemic lupus erythematosus 
(SLE), adding an additional pos- 
sible complication to an already 
often-severe systemic autoim- 
mune disease. Chen et al. found 
that the cell surface enzyme 
CD38 compromises mitochon- 
drial fitness by suppressing 
mitophagy in CD8* T cells from 
lupus patients. Administration 
of a CD38 enzymatic inhibitor 
reverses cytotoxic T cell defects, 
which are thought to be a cause 
of the impaired immune func- 
tion. Because CD38 inhibitors 
are available for the clinical 
setting, this finding opens up 
interesting new avenues to 
hopefully restore immune func- 
tion in SLE patients. -MGvH 
Sci. Adv. 10.1126/ 
sciadv.abo4271 (2022). 


HEART REPAIR 
Meteorin-like signals 
through KIT 


The adult mammalian heart has 
limited regenerative capacity 
and heals by scar formation 
after myocardial infarction. 
Effective repair of the injured 
heart entails a vigorous angio- 
genic response that mitigates 
scarring and helps to preserve 
pump function. Reboll et al. 
identified the myeloid cell- 
derived cytokine meteorin-like 
(METRNL) protein as being a 
critical driver of postinfarction 
angiogenesis and a high- 
affinity ligand for the stem cell 
factor receptor KIT (see the 
Perspective by Srivastava). 
METRNL promotes infarct 
repair in mice by selectively 
expanding the KIT-expressing 
endothelial cell population in 
the infarct border zone. Failure 
to mount this KIT-dependent 
angiogenic response provokes 
severe postinfarction heart 


1282 


failure. METRNL provides a 
mechanism for activating endo- 
thelial KIT signaling after tissue 
injury. —BAP 
Science, abn3027, this issue p. 1343; 
see also adc8698, p. 1271 


CANCER IMMUNOLOGY 
Engulfing to improve 
radiation 


Despite the wide use of radia- 
tion therapy (RT) in colorectal 
cancer, many patients expe- 
rience progression at 
non-irradiated sites of disease. 
Hsieh et al. used mouse model- 
ing and human cancer cell lines 
to show that RT increased the 
expression of the cell surface 
markers CD47 and PD-Llina 
DNA repair signaling—depen- 
dent manner. Targeting these 
pathways with antibodies 
against their ligands, SIRPa 
and PD-1, in combination with 
RT, led to clearance of primary 
tumors and robust abscopal 
effects. This triple combina- 
tion depended on host STING 
expression, leading to improved 
tumor cell phagocytosis and 
subsequent cross-priming 
of tumor antigen-specific T 
cells. These data suggest that 
targeting immune checkpoints 
and phagocytosis during RT 
may help in the treatment of 
patients with colorectal cancer. 
—DAE 

Sci. Immunol. 7, eabl9330 (2022). 


HISTORICAL DROUGHT 
Arabian aridity 


Islam emerged in the Arabian 
Peninsula in the early seventh 
century CE after a period of 
profound societal changes. 
What was the environmental 
context of that emergence? 
Fleitmann et al. present 
a hydroclimate record of 
Southern Arabia showing 
that droughts plagued the 
region throughout the sixth 
century CE. They suggest that 
increasing aridity and declin- 
ing agricultural yields may 
have contributed to demise of 
Himyar, the dominant power in 
Arabia at that time. —HJS 
Science, abg4044, this issue p. 1317 
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SPECIES INTERACTIONS 
Mismatches matter 


s the climate changes, interacting species often respond 
differently to changing seasonal cues. This phenomenon 
can lead to phenological mismatches in which temporal 
decoupling between species occurs, such as predators 
emerging before or after their prey. Wilde et al. investi- 
gated how phenological mismatches caused by earlier insect 
emergence influence the growth and survival of chicks of the 
Hudsonian godwit (Limosa haemastica, an Alaskan shorebird 
shown here). The young godwits’ growth and survival were 
highest during periods of high insect abundance, but chicks 
also required larger insects as they grew. Thus, the consum- 
er’s population success depends not just on synchrony with 
its prey, but also on overlap between individuals’ changing 
resource needs and the quantity and quality of available 


resources. —BEL 
Ecology 10.1002/ecy.3743 (2022). 


IMMUNOLOGY 
A Pten-tial cause of 


autoimmunity? 

Although platelets are mostly 
known for their role in clotting, 
they also have immunoregula- 
tory functions. Hyperactive 
platelets are associated with 


several autoimmune diseases. 
Chen et al. report that mice with 
hyperactive platelets develop 
severe age-related autoimmune 
and lymphoproliferative disease. 
Platelet-specific deletion of a 
gene called phosphatase and 
tensin homolog (Pten) results in 
excessive platelet aggregation 
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CELL BIOLOGY 
Making a chromosomal 
bouquet 


During meiosis, chromosomal 
pairing requires microtubule- 
dependent telomere tethering 
and rotation on the nuclear 
envelope, a process that also 
drives chromosome homol- 
ogy searches. Formation of the 
structure called the “zygotene 
chromosomal bouquet” involves 
telomere pulling toward the cen- 
trosome. Mytlis et al. describe 
the so-called “zygotene cilium,” 
which helps to ensure chromo- 
somal bouquet formation and 
germ cell morphogenesis in 
zebrafish and mouse oocytes. 
The cilium anchors the cen- 
trosome, allowing it to resist 
telomere pulling, and is essential 
for bouquet and synaptonemal 
complex formation and success- 
ful oogenesis. —SMH and BAP 
Science, abh3104, this issue p. 1284 


INNATE IMMUNITY 
Human OTULIN 
haploinsufficiency 


The interindividual clini- 

cal variability upon infection 
with Staphylococcus aureus 

in otherwise healthy people 

is poorly understood. Using 
human genetic approaches, 
Spaan et al. found a genome- 
wide enrichment for rare 
heterozygous OTULIN vari- 
ants in a cohort of patients 
with severe staphylococcal 
disease. Autosomal-dominant 
OTULIN deficiency operates 

by haploinsufficiency and is 
phenocopied in patients with 
the more common 5p- (or 

Cri du Chat) chromosomal 
deletion syndrome. OTULIN 
haploinsufficiency underlies 
staphylococcal necrosis of the 
skin and lungs by increasing the 
susceptibility of tissue-resident, 
nonhematopoietic cells to the 
major staphylococcal virulence 
factor a-toxin. Naturally elicited 
a-toxin—neutralizing antibodies 
apparently rescue the disorder 


and contribute to incomplete 
clinical penetrance. —SMH 
Science, abm6380, this issue p.1285 


PLANT SCIENCE 
Heat tolerance in rice 


Too much heat can damage 
a plant's chloroplasts, driving 
yield down when temperatures 
exceed a crop’s normal toler- 
ance. Zhang et al. identified 
a locus with two genes that 
together enhance rice thermo- 
tolerance. The ubiquitin ligase 
activity of Thermo-tolerance 3.1 
(TT3.1) drove degradation of 
TT3.2, a chloroplast precursor 
protein that can trigger chlo- 
roplast damage in the context 
of heat stress. Together, these 
protein products respond to heat 
and control the damage. —PJH 
Science, abo5721, this issue p. 1293 


NEURODEVELOPMENT 
RNA modification hub at 
the centrosome 


The centrosome acts as the cell's 
microtubule-organizing center, 
supporting cell division and the 
extension of cilia and neurites. 
Newly born neurons require the 
microtubule-organizing activity 
of centrosomes to migrate away 
from their birthplace at the ven- 
tricle. O'Neill et al. analyzed the 
centrosome proteome of human 
induced pluripotent stem cell— 
derived neural stem cells and 
neurons. The neural centrosome 
proteome contains a variety of 
RNA-binding/modifying proteins, 
including an RNA-splicing factor 
mutation that is linked to periven- 
tricular heterotopia. —PJH 
Science, abf9088, this issue p. 1286 


ROBOTIC MATERIALS 
Form and function 
all in one 


Piezoelectric actuators are one 
route to driving motion in robotic 
systems. However, one typically 
needs either multiple crystals or 
engineered structures to allow 


1283-B 17 JUNE 2022 + VOL 376 ISSUE 6599 


motion with multiple degrees 
of freedom. Cui et al. designed 
architected materials composed 
of conductive and piezoelectric 
materials that couple electric 
field and mechanical strain (see 
the Perspective by Rafsanjani). 
The authors engineered these 
three-dimensional materi- 
als to be capable of a variety 
of motions and transducer 
functions by using additive man- 
ufacturing to build the complex 
shapes. They demonstrate their 
functionality for actuation and 
sensing in a unified miniaturized 
mobile robot that can move, 
sense, and perform feedback 
control. —MSL 

Science, abnOO90, this issue p. 1287; 

see also abq4102, p. 1272 


OIL POLLUTION 


Slick findings 
Ocean oil slicks may have natural 
or anthropogenic sources, but 
how many occur, how extensive 
they are, and in what proportions 
they happen are unclear. Dong 
et al. present a global oil slick 
map and a detailed inventory of 
static-and-persistent sources 
for the period between 2014 and 
2019 (see the Perspective by 
Leifer). They observed a highly 
uneven distribution of slicks, 
with most of them located within 
160 kilometers of coastlines 
and along shipping routes. 
Anthropogenic sources consti- 
tute the overwhelming majority 
of cases, and their numbers 
show that their contribution may 
have been significantly underes- 
timated in the past. —HJS 
Science, abm5940, this issue p. 1300; 
see also abp8666, p. 1266 


PHOTONICS 

On-chip optical 
amplification 

The success of long-haul opti- 
cal communications and our 
information society is largely due 
to the invention of the erbium- 


doped fiber amplifier. Because 
the need for faster chips is 


expected to see a shift from 
electronics- to photonics-based 
technologies, erbium ions could 
form the basis for amplification 
in optical integrated circuits. Liu 
et al. used an ultra-low-loss sili- 
con nitride photonic integrated 
circuit with a waveguide length 
up to 0.5 meters and erbium 
ion implantation to fabricate an 
erbium-doped waveguide ampli- 
fier on a compact photonic chip 
(see the Perspective by Kim). 
Operating in the continuous- 
wave regime and providing large 
optical gain in the telecommu- 
nication bands, the results are 
promising for device applica- 
tions. —ISO 

Science, abo2631, this issue p. 1309; 

see alSo abq8422, p.1269 


CONSERVATION 
Anew hope 


Polar bears are one of the most 
mentioned—and iconic—poten- 
tial victims of climate change. 
Most polar bears rely on sea 
ice to hunt, so the current and 
predicted reductions in sea ice 
occurrence and persistence 
are likely to have major impacts 
on their survival. Laidre et al. 
describe the discovery of an iso- 
lated population of polar bears 
from southeastern Greenland 
that is much less reliant on 
sea ice, instead existing at the 
terminal end of a glacier and 
relying on resources from this 
glacial-freshwater mélange (see 
the Perspective by Peacock). 
Discovery of this population 
suggests both that such environ- 
ments might serve as refugia for 
polar bears and that conserva- 
tion of this new population is 
essential. -SNV 

Science, abk2793, this issue p. 1333; 

see also abq5267, p. 1267 


BIOCONJUGATION 
Enzymes for tailored 
insulin derivatives 


The development of drugs 
based on protein hormones 
such as insulin is a challenge 
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because many of the usual tools 
in organic synthesis are either 
not compatible with folded 
proteins or lack selectivity when 
redundant functional groups 
are present. Fryszkowska et al. 
generated a panel of enzymes 
that selectively modify one or 
more of the three amine groups 
of insulin, adding or remov- 
ing protecting groups (see the 
Perspective by Lin and Chou). 
The enzymatically functionalized 
molecules can then be modi- 
fied by electrophilic reagents 
to produce homogeneous 
insulin conjugates. The authors 
also demonstrate enzymatic 
activity on a range of bioactive 
peptides, suggesting that bio- 
catalytic approaches may help 
in the development of potential 
peptide-based therapeutics or 
probes. —MAF 

Science, abn2009, this issue p. 1321; 

see also abq7217, p. 1270 


PHYSIOLOGY 
Hearing better with 
TRal signaling 


Mutations that affect the thyroid 
hormone receptor TRB cause 
resistance to thyroid hormone 
and have been associated with 
deafness. Affortit et al. dem- 
onstrated hearing loss that 
worsened with age in juvenile 
mice with a frameshift mutation 
(similar to disease-associated 
mutations in humans) in the 
gene encoding the related 
thyroid hormone receptor iso- 
form TRal. The cochlear outer 
hair cells in these mice were 
abnormally oriented or absent 
and showed mislocalization of 
a prosurvival factor for outer 
hair cells and increased signs of 
oxidative stress. -AMV 

Sci. Signal. 15, eabj4583 (2022). 


HEART MUSCLE DISEASE 
Conditional 


cardioprotection 


Phosphodiesterase 5 (PDE5) 
inhibition has been shown to have 
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inconsistent cardioprotective 
effects. Pofi et al. conducted a 
small phase 4 trial to assess sex 
differences in cardiac remodel- 
ing in patients with diabetes and 
diabetic cardiomyopathy treated 
with the PDES inhibitor tadalafil. 
They observed an improvement 
in cardiac shortening and torsion 
in men but not women, but the 
circulating small noncoding RNA 
hsa-miR-199-5p was reduced 
and the protein Klotho was 
increased with treatment in both 
sexes. Certain populations of 
monocytes were also altered with 
treatment. These results highlight 
the importance of considering 
sex-specific differences in treat- 
ment response. —CC 

Sci. Transl. Med. 14, eabl8503 (2022). 


GENETICS 
How trypanosomes 
go quiet 
The parasite Trypanosoma brucei 
causes sleeping sickness, which 
can be fatal. At a critical density, 
the parasite differentiates into 
a quiescent stumpy form. This 
transition promotes host survival 
but also facilitates trypanosome 
transmission. Guegan et al. 
conducted RNA sequencing in T. 
brucei to identify 1428 previously 
unknown long noncoding RNA 
(IncRNA) genes. They found 
that one of these genes, named 
grumpy, promotes differentia- 
tion into the quiescent form, and 
that this process is mediated by 
a small nucleolar RNA (SnoRNA) 
encoded within the IncRNA. The 
snoRNA binds to the mRNAs of 
at least two genes that regulate 
stumpy differentiation, enhancing 
their expression. Overexpression 
of grumpy reduces parasitemia 
in infected mice, indicating that 
trypanosome IncRNAs regulate 
parasite-host interactions. —-DLD 
Sci. Adv. 10.1126/ 
sciadv.abn2706 (2022). 


17 JUNE 2022 « VOL 376 ISSUE 6599 


1283-C 


RESEARCH | IN SCIENCE JOURNALS 


studies of this fundamental 
biological process. —DJ 
Science, abg3875, this issue p. 1338 


BIOLOGY 
A tractable lead for lupus 


Lack of optimal immune 
defense functions and thus 
increased risk for infections is 
a known issue for patients with 
systemic lupus erythematosus 
(SLE), adding an additional pos- 
sible complication to an already 
often-severe systemic autoim- 
mune disease. Chen et al. found 
that the cell surface enzyme 
CD38 compromises mitochon- 
drial fitness by suppressing 
mitophagy in CD8* T cells from 
lupus patients. Administration 
of a CD38 enzymatic inhibitor 
reverses cytotoxic T cell defects, 
which are thought to be a cause 
of the impaired immune func- 
tion. Because CD38 inhibitors 
are available for the clinical 
setting, this finding opens up 
interesting new avenues to 
hopefully restore immune func- 
tion in SLE patients. -MGvH 
Sci. Adv. 10.1126/ 
sciadv.abo4271 (2022). 


HEART REPAIR 
Meteorin-like signals 
through KIT 


The adult mammalian heart has 
limited regenerative capacity 
and heals by scar formation 
after myocardial infarction. 
Effective repair of the injured 
heart entails a vigorous angio- 
genic response that mitigates 
scarring and helps to preserve 
pump function. Reboll et al. 
identified the myeloid cell- 
derived cytokine meteorin-like 
(METRNL) protein as being a 
critical driver of postinfarction 
angiogenesis and a high- 
affinity ligand for the stem cell 
factor receptor KIT (see the 
Perspective by Srivastava). 
METRNL promotes infarct 
repair in mice by selectively 
expanding the KIT-expressing 
endothelial cell population in 
the infarct border zone. Failure 
to mount this KIT-dependent 
angiogenic response provokes 
severe postinfarction heart 
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failure. METRNL provides a 
mechanism for activating endo- 
thelial KIT signaling after tissue 
injury. —BAP 
Science, abn3027, this issue p. 1343; 
see also adc8698, p. 1271 


CANCER IMMUNOLOGY 
Engulfing to improve 
radiation 


Despite the wide use of radia- 
tion therapy (RT) in colorectal 
cancer, many patients expe- 
rience progression at 
non-irradiated sites of disease. 
Hsieh et al. used mouse model- 
ing and human cancer cell lines 
to show that RT increased the 
expression of the cell surface 
markers CD47 and PD-Llina 
DNA repair signaling—depen- 
dent manner. Targeting these 
pathways with antibodies 
against their ligands, SIRPa 
and PD-1, in combination with 
RT, led to clearance of primary 
tumors and robust abscopal 
effects. This triple combina- 
tion depended on host STING 
expression, leading to improved 
tumor cell phagocytosis and 
subsequent cross-priming 
of tumor antigen-specific T 
cells. These data suggest that 
targeting immune checkpoints 
and phagocytosis during RT 
may help in the treatment of 
patients with colorectal cancer. 
—DAE 

Sci. Immunol. 7, eabl9330 (2022). 


HISTORICAL DROUGHT 
Arabian aridity 


Islam emerged in the Arabian 
Peninsula in the early seventh 
century CE after a period of 
profound societal changes. 
What was the environmental 
context of that emergence? 
Fleitmann et al. present 
a hydroclimate record of 
Southern Arabia showing 
that droughts plagued the 
region throughout the sixth 
century CE. They suggest that 
increasing aridity and declin- 
ing agricultural yields may 
have contributed to demise of 
Himyar, the dominant power in 
Arabia at that time. —HJS 
Science, abg4044, this issue p. 1317 
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SPECIES INTERACTIONS 
Mismatches matter 


s the climate changes, interacting species often respond 
differently to changing seasonal cues. This phenomenon 
can lead to phenological mismatches in which temporal 
decoupling between species occurs, such as predators 
emerging before or after their prey. Wilde et al. investi- 
gated how phenological mismatches caused by earlier insect 
emergence influence the growth and survival of chicks of the 
Hudsonian godwit (Limosa haemastica, an Alaskan shorebird 
shown here). The young godwits’ growth and survival were 
highest during periods of high insect abundance, but chicks 
also required larger insects as they grew. Thus, the consum- 
er’s population success depends not just on synchrony with 
its prey, but also on overlap between individuals’ changing 
resource needs and the quantity and quality of available 


resources. —BEL 
Ecology 10.1002/ecy.3743 (2022). 


IMMUNOLOGY 
A Pten-tial cause of 


autoimmunity? 

Although platelets are mostly 
known for their role in clotting, 
they also have immunoregula- 
tory functions. Hyperactive 
platelets are associated with 


several autoimmune diseases. 
Chen et al. report that mice with 
hyperactive platelets develop 
severe age-related autoimmune 
and lymphoproliferative disease. 
Platelet-specific deletion of a 
gene called phosphatase and 
tensin homolog (Pten) results in 
excessive platelet aggregation 
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and secretion of cytokines that 
promote the development of T 
follicular helper (Tfh) cells. Tfh 
cells support B cell antibody 
production in lymphoid ger- 
minal centers. Future studies 
will be needed to test whether 
platelet-mediated enhancement 
of Tfh cell activity is a hallmark 
of any autoimmune diseases in 
humans. —STS 

Nat Commun 13, 2762 (2022). 


MICROBIOME 


Microbiome and statins 
Statins, or HMG-CoA reductase 
inhibitors, are cholesterol- 
reducing drugs that are among 
the most prescribed medica- 
tions. Their effectiveness and 
side effects are very heteroge- 
neous. Human gut bacteria are 
known to metabolize statins in 
vitro, and their prevalence is 
also heterogeneous. In people 
prescribed known doses of 
statins, Wilmanski et al. showed 
that blood concentrations 
of the metabolite 3-hydroxy- 
3-methylglutarate-coenzyme-A 
(HMG), the natural substrate 
for HMG-CoA reductase, is 
potentially a serum biomarker 
for statin responses. Further, 
shotgun sequencing of stool 
samples found that people on 
statins who were enriched for 
Bacteroides gut bacteria tended 
to have greater risk of metabolic 
disruption than those with a 
preponderance of Firmicutes 
bacteria. —YN 

Med 10.1016/j.medj.2022.04.007 

(2022). 


NEURONAL CELL BIOLOGY 


An axonal laddER 


Neuronal axons contain their 
own complement of cyto- 
plasmic organelles, including 
endoplasmic reticulum (ER). 

It is challenging to explain how 
ER tubules are packed into the 
elongated axon of a neuronal 
cell. Zamponi et al. examined 
the ER in developing mam- 
malian axons at nanometer 
resolution. The axonal ER inter- 
calates and engages in close 
contacts with individual micro- 
tubules to make the “rungs” 

of ladder-like structures. In 
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developing neurons, the ER 
axonal ladder acts as a reservoir 
for key components of the cal- 
cium-handling machinery. The 
ER-shaping protein Reticulon 2 
(Rtn2) influences the archi- 
tecture and dynamics of the 
ladder by linking ER tubules to 
kinesin motors on microtubules. 
Mutant Rtn2, which is linked to 
hereditary spastic paraplegia, 
disrupts the ladder by compet- 
ing for kinesin binding, and thus 
impairs ER trafficking in axons. 
—SMH 
Dev. Cell 10.1016/ 
j.devcel.2022.05.002 (2022). 


2D MATERIALS 
Excitons confined 


Using electric fields to confine 
neutral objects such as excitons, 
which consist of an electron and 
a hole, can be tricky. Thureja et 
al. used the Stark effect, which 
is the shifting of energy levels in 
the presence of electric fields, 
to engineer the confinement of 
excitons in monolayer molybde- 
num diselenide. The researchers 
applied an inhomogeneous 
in-plane electric field, which 
induced a permanent dipole 


moment for excitons, creating 
a potential minimum at the 
position of the field maximum. 
The repulsive interaction of 
excitons with free carriers 
further contributed to the con- 
finement. This resulted in the 
quantization of exciton motional 
states, which was detected by 
measuring the sample's optical 
response. The technique may be 
useful in applications, as well as 
in fundamental studies. —JS 
Nature 10.1038/ 
$41586-022-04634-z (2022). 


CHEMISTRY 


N with four phenyls 

Salts with nitrogen bound to 
four carbons are extremely 
common and find use ina 
variety of cleaning products. 
Surprisingly, however, the 
simple cation in which all four 
substituents are phenyl rings 
has eluded full characterization. 
The chief problem has been that 
the triphenylamine precursor 

is not sufficiently reactive to 
add the fourth ring. Fujita et al. 
surmounted that problem by 
appending (and later remov- 
ing) tert-butyl substituents to 


the rings and applying radical 
coupling to form the fourth 
nitrogen—carbon bond. The 
cation, paired with a borate, 
crystallized with the rings 
canted in S, symmetry. —JSY 
Nat. Commun. 10.1038/ 
$41467-022-30282-y (2022). 


OZONE HOLE 


A bumpy road 
The Antarctic stratospheric 
ozone hole has gradually been 
shrinking due to the decrease 
in anthropogenic emissions of 
ozone-depleting substances. 
However, in 2020 and 2021, it 
grew again to sizes among the 
largest on record. What was 
the reason for this? Yook et al. 
suggest that its growth was the 
result of the injection of anoma- 
lously large amounts of aerosols 
into the stratosphere by two 
episodic events: the Australian 
bushfires of 2020 and the volca- 
nic eruption of La Soufriere on 
Saint Vincent in 2021. Southern 
Hemisphere surface climate also 
seems to have been influenced 
by these ozone holes. —HJS 
Geophys. Res. Lett. 
10.1029/2022GL098064 (2022). 


The 2021 eruption of the La Soufriere volcano on the Caribbean island of Saint Vincent helped to increase 
the size of the Antarctic ozone hole later that year. 
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INTRODUCTION: The centrosome is an inter- 
action hub composed of two centrioles sur- 
rounded by pericentriolar material that 
collectively exerts many pancellular functions, 
such as cell division, cell migration, and cilia 
formation. The centrosome acts as the main 
microtubule-organizing center (MTOC) in many 
cells, including stem and progenitor cells, but 
loses this activity often during differentiation. 
Very little is known, however, about the extent 
of its cell type-specific composition and func- 
tion. Individual proteins have been found to 
be specific to the centrosome of, for example, 
neural stem cell subtypes, but whether these 
are exceptions or the rule is unknown. 


RATIONALE: To assess any potential cell type- 
specific functions of the centrosome, its com- 
position needs to be further investigated. 
However, no comprehensive proteome of neu- 
ral centrosomes exists to date, and hence, the 
differences in centrosome composition be- 


Neural centrosome 
proteome identifies dis- 
ease candidates. Spatial 
proteomics of human 
neural stem cell and 
neuronal centrosomes 
uncovers cell type- 
specific protein hubs. 
Overlapping the pro- 
teomes with de novo 
mutations identified in 
patients with neurodevel- 
opmental diseases 
revealed cell type-specific 
disease associations, 
enabling prioritization of 
disease variants. Among 
those, the expression of 
the PH-associated mutant 
R23W [in which arginine (R) 
at position 23 is replaced 
with tryptophan (W); red] 
PRPF6 (blue) recapitulated 
the periventricular cellular 


Neurons 
misplacement in the developing mouse brain by missplicing of brain-specific serine/threonine kinase 2 (Brsk2). 


O'Neill et al., Science 3'76, 1286 (2022) 


Neural stem cells 


17 June 2022 


tween neural and other cell types are unknown. 
Likewise, the extent of the changes in this 
organelle’s distinct makeup during the differ- 
entiation of neural stem cells to neurons has 
not been explored. Because centrosome dys- 
function is also linked to many neurodevel- 
opmental conditions, information from such 
analysis could identify yet unknown disease 
associations. 


RESULTS: To map the centrosome proteome 
of human neural stem cells and neurons, we 
chose a spatial proteomic approach to identify 
not only which proteins are present at this 
organelle but also where they are localized. 
Specifically, we selected 10 bait proteins known 
to localize to distinct sites of the centrosome, 
immunoprecipitated them from induced plu- 
ripotent stem cell-derived neural stem cells 
and neurons, and reproducibly determined 
their interactome with mass spectrometry. 
Interrogation of their interacting partners 


Brsk2 
=! —_t- ay] 
WT 
| i. PRPF6 R23W 
Periventricular Brsk2419 


heterotopia 


9 <e 


revealed diversity at this organelle, in which 
around 60% of the centrosome proteins had 
not yet been detected at the centrosome in 
other cell types. Furthermore, upon neuronal 
differentiation, more than half of these pro- 
teins become exchanged for new interactions 
at specific localizations within the centrosome. 
The neural centrosome proteomes comprise 
significantly enriched Gene Ontology terms 
of RNA-interacting proteins that were not 
observed in other cell types. Overlapping the 
neural stem cell and neuron centrosome pro- 
teomes with gene variants observed in patients 
with neurodevelopmental conditions of un- 
known etiology highlights specific and signif- 
icant enrichment in epilepsy patients for the 
neuronal and, in periventricular heterotopia 
(PH), for the neural stem cell centrosome pro- 
teome. With respect to PH, we explored the 
effect of one candidate variant within the ubiq- 
uitously expressed gene that encodes the pre- 
mRNA processing factor 6 (PRPF6). We chose 
this candidate because several members of the 
PRPF6 complex were detected at the neural 
stem cell centrosome and had variants asso- 
ciated with PH. We show that the specific 
mutation of PRPF6 recapitulates aspects of the 
disease phenotype with ectopic cell localization 
in the periventricular region of the developing 
mouse cortex. Expression of the mutated form 
of PRPF6 results in misregulated splicing of, 
among others, the microtubule-associated pro- 
tein kinase Brsk2. Coexpression of the correctly 
spliced form—but not the misspliced form, 
which lacks exon 19—with the mutant PRPF6 
rescued the aberrant cell accumulation at the 
ventricle. The localization of Brsk2 mRNA at 
the centrosome is consistent with a role for 
PRPF6 in bringing its splicing targets to the 
centrosome for local translation and fine 
tuning of microtubule function at the centro- 
some for proper migration out of the peri- 
ventricular region. 


CONCLUSION: Centrosome composition differs 
between cell types, offering a diversity that is 
important for development and disease. The 
ubiquitously expressed protein PRPF6 is enriched 
at the centrosome in neural stem cells but not 
neurons, which causes, when mutated, a PH- 
like phenotype. The extensive characterization 
of centrosome proteins unraveled in this study 
provides a rich resource with which to explore 
further disease associations and cell type- and 
stage-specific functions. 


The list of author affiliations is available in the full article online. 
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The centrosome provides an intracellular anchor for the cytoskeleton, regulating cell division, cell migration, 
and cilia formation. We used spatial proteomics to elucidate protein interaction networks at the centrosome of 
human induced pluripotent stem cell-derived neural stem cells (NSCs) and neurons. Centrosome-associated 
proteins were largely cell type-specific, with protein hubs involved in RNA dynamics. Analysis of 
neurodevelopmental disease cohorts identified a significant overrepresentation of NSC centrosome proteins 
with variants in patients with periventricular heterotopia (PH). Expressing the PH-associated mutant 
pre-mRNA-processing factor 6 (PRPF6) reproduced the periventricular misplacement in the developing 
mouse brain, highlighting missplicing of transcripts of a microtubule-associated kinase with centrosomal 
location as essential for the phenotype. Collectively, cell type-specific centrosome interactomes explain 
how genetic variants in ubiquitous proteins may convey brain-specific phenotypes. 


he centrosome acts as a hub for the cyto- 
skeleton and regulates many processes 
in development (J). It is composed of two 
centrioles of differing maturity, called 
the mother and daughter centrioles (2). 
Microtubules are anchored at the more mature 
mother centriole through its subdistal appen- 
dages (3). This feature is central to the function 
of the centrosome as the primary microtubule- 
organizing center (MTOC) in animal cells (4, 5). 
Centrosomal MTOC activity changes during 
development, increasing, for example, in de- 
laminating neural stem cells (NSCs) and de- 
creasing in migrating neurons, a process that is 
regulated by the newly identified centrosomal 
protein formerly named AT-hook-containing 
transcription factor (AKNA) (6). Although cen- 
trosome proteomes have been cataloged for 
cancer cells and Drosophila (7-10), the dynamic 
relationship of AKNA with the centrosome 
highlights the need to comprehensively inves- 
tigate the potential heterogeneity of centro- 
some interactors in brain cells. We identified 
the centrosome proteome of human NSCs 
and neurons, showing their cell type-specific 
relevance to the neurodevelopmental disorder 
periventricular heterotopia (PH). 


Results 
Spatial centrosome proteome of NSCs and neurons 


To investigate the centrosome proteome of 
human NSCs and neurons, induced pluripotent 
stem cells (iPSCs) were differentiated toward a 
dorsal forebrain identity (Fig. 1A) (1D). At day 
15 of differentiation, almost all cells (96.6%) 
were PAX6* NSCs (Fig. 1, B and D, and fig. S1A), 
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whereas neurons reached high purity at around 
day 40 (Fig. 1, C and D, and fig. S1A) and ex- 
hibited known centrosome dynamics, such as 
NINEIN loss from this organelle (Fig. 1, E to G) 
(12). We therefore chose these time points to 
probe the centrosome proteomes of NSCs and 
neurons by using mass spectrometry. 

To inform about the spatial distribution of 
the interactors at the centrosome, we designed 
an affinity purification strategy that targets 
10 different “bait” proteins essential for correct 
centrosome function, each localizing at differ- 
ent regions within this organelle (Fig. 1H) (73). 
In NSC cultures harvested at day 15 from four 
biological replicates, 1401 high-confidence inter- 
actions comprising 751 proteins were identified, 
including many centrosomal proteins from 
curated reference lists and previous studies 
(14-18), thus underscoring the robustness of 
the approach (Fig. 1H; figs. S1, D to F, and S2H; 
and table S1). We detected 480 proteins that 
were not allocated to the centrosome in previ- 
ously studied cell types (Fig. 11). As expected, 
the NSC centrosome proteome is enriched 
for Gene Ontology (GO) terms related to cell 
division and microtubule organization, among 
others (Table 1). However, among the highly 
significant GO terms (P values are provided in 
Table 1) were also mRNA processing, splicing, 
and metabolism, which were not present in 
previous centrosome datasets analyzed in the 
same manner (Table 1 and table S4). Over- 
lapping protein-protein interaction networks 
of multiple baits can inform on spatial dis- 
tribution and organellar dynamics (79). We 
therefore clustered the protein interactions for 
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these 10 bait proteins within a force-directed 
layout by use of Cytoscape. Bait-prey positions 
within the network are dependent on their 
common interactions with other bait proteins, 
as shown in the spatial projection (Fig. 1, L 
and M). This revealed enrichment of RNA- 
interacting proteins at specific baits, including 
the subdistal appendage proteins centrosomal 
protein of 170 kDa (CEP170) and outer dense 
fiber of sperm tails 2 (ODF2) (Fig. 1M). Cen- 
trosome localization of these RNA-interacting 
proteins was not dependent on microtubules 
because they persisted in the centrosome 
interactome of NSCs after treatment with 
microtubule-depolymerizing nocodazole (fig. S3 
and tables S1 and S83). Thus, the centrosome 
interactors detected in NSCs may shed light on 
brain-specific functions at the centrosome. 

To ask whether these interactions were 
brain-specific or NSC-specific, we applied 
affinity purification of the same 10 bait proteins 
in neurons, collected at day 40 of iPSC dif- 
ferentiation (Fig. 1, C and D). This revealed 
786 proteins enriched at the centrosome in 
neurons (Fig. 1, H and J, and tables S2 and S3), 
with about half of the centrosome interactome 
present only at one stage—59% in neurons 
and 57% in NSCs (Fig. 1K). Of these, the 
majority (64 and 57% in NSCs and neurons, 
respectively) were not present in other centro- 
some datasets (Fig. 1, I to K). RNA-related 
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Fig. 1. Spatiotemporal profiling of the neural centrosome interactome. 
(A) Schematic overview of the study design. (B to G) Immunostainings of 
human iPSC-derived cells at the stages indicated for antigens indicated on the 
eft, quantified in (D) and (G). Scale bars, (B) and (C) 50 um; (E) and (F) 

0 um. (H) Schematic representation of the mammalian centrosome with 
the position of the 10 bait proteins indicated, informed by (13, 64), and 

the number of interactors (n) in NSCs (blue) and neurons (yellow). 


(I to K) Comparison of the iPSC-derived (1) NSC and (J) neuron centrosome- 
interactome, with the pooled human centrosome protein list derived from 
curated databases (14, 17, 18) and previously published BiolD screens (15, 16) 
and (K) with each other. (L and M) Force-directed bait-prey interactome of 
NSCs, with (L) previously unidentified interactors [not found in the datasets 
in (I) to (K)] (green) and (M) proteins associated with splicing and RNA 
export-related GO terms (red) highlighted. 


functions, such as RNA localization or RNA 
metabolic processes, remained the top GO 
terms in both neural proteomes (Table 1 and 
tables S4 and S8), with RNA splicing selectively 
enriched in the NSC centrosome proteome 
(Table 1 and figs. S2, A to F), comprising a com- 
plex of pre-mRNA-processing factor 6 (PRPF6), 
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apoptotic chromatin condensation inducer 1 
(ACIN1), DEAD-box helicase 23 (DDX23), and 
protein virilizer homolog (VIRMA/KIAA1429). 

Visualization of the spatial centrosome inter- 
actomes shows that changes during neuronal 
differentiation are bait-specific (Fig. 2A; speci- 
ficity of baits at the centrosome is provided in 
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fig. S4). Most interactors lost during differen- 
tiation (significantly enriched at the centro- 
some in NSCs, but no longer in neurons) are 
associated with the baits ODF2 and CEP170 
at the subdistal appendages and the baits 
CDK5 regulatory subunit associated protein 
2 (CDK5RAP2) and centrosomal protein of 
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Table 1. GO enrichment for this and previous centrosome databases. Numbers indicate the false discovery rate (FDR) for each term in each dataset 
indicated (stringency cutoff, 5%). Terms are sorted in ascending order of the FDR difference between NSC and neurons. Complete lists of GO terms 


are provided in tables S4 and S8. 


GO identifier 
GO:0008380 


G0:0006413 


*The notable terms for either cell type. 


192 kDa (CEP192) associated with the pericen- 
triolar material (Fig. 2A). This fits with the 
known loss of centrosome MTOC activity during 
neuronal differentiation (6, 20), the reduction of 
CEP170 at the centrosome during cell differen- 
tiation, and the role of CEP192 in controlling the 
balance of centrosomal and noncentrosomal 
MTOC (2]-23). Centrosome interactors gained 
in neurons were often associated with centroso- 
mal protein of 63 kDa (CEP63), forming in- 
teractions with the actin network and included 
RNA-interacting proteins enriched at differ- 
ent baits (Fig. 2B and fig. S2) as compared 
with those in NSCs (Fig. 1M). 

Although these dynamic changes imply con- 
fidence in the selectivity of the centrosome 
interactors, we further probed this by compar- 
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ing with the total cellular proteome (24, 25) 
of NSCs and neurons differentiated from the 
same human iPSC line by using the protocol 
described above. Most of the proteins detected 
as significantly enriched at the centrosome in 
neurons, but not NSCs (or vice versa), were not 
regulated between these cell types within the 
total proteomes, including proteins further 
highlighted in this study (fig. S2, I and J). The 
overall abundance of bait proteins did not 
change between NSCs and neurons either, 
with the exception of CEP170 and Centrobin 
(CNTROB), which are higher in neurons (fig. 
$2J), but their number of interactors was re- 
duced in neurons or remained the same, re- 
spectively (Fig. 1D). Consistent with the lower 
number of interactors of CEP170 (fig. SID), its 
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centrosomal association has been shown pre- 
viously to decrease during differentiation 
(23), and we also found reduced levels at the 
centrosome by means of immunostaining (fig. 
S4). Overall, these data corroborate the spe- 
cificity of the centrosome enrichment in dif- 
ferent cell types. 

Because the above data suggest neural cell- 
type specificity of centrosome-interacting pro- 
teins with a preponderance to RNA binding 
and RNA-processing factors in both neural cell 
types, we next validated sets of those with im- 
munostaining (Fig. 2, C to H, and fig. S5, K to Q) 
or Western blotting after coimmunoprecipitation 
with the respective bait proteins (Fig. 2, I to K, 
and fig. S5, A to J). The centrosome associ- 
ation of the exon-junction proteins (MAGOH 
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Fig. 2. Cell type-specific RNA-processing proteins at the centrosomes. 

(A) Combined view of the force-directed bait-prey interactomes of the NSC and 
neuron centrosomes. (B) Proteins associated with splicing and RNA export- 
related GO terms (red) at the neuronal centrosome (NSCs are provided in Fig. 1M). 
(C to H) Immunostainings confirming the localization of selected RNA binding 
proteins at the centrosome in human iPSC-derived NSCs at day 16. White 
dashed boxes outline colocalization of the proteins indicated in green, with 
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the centrosomal markers in magenta shown to the right in higher magnification. 
Scale bars, 2.5 um. (I to K) Coimmunoprecipitation with bait proteins 
followed by Western blotting of the indicated preys to validate the liquid 
chromatography—mass spectrometry (LC-MS)/MS findings. [(I) and (J)] FMRP 
and FXR2P were pulled down by bait proteins POC5 and CEP135 at day 15, and 
(K) AGO1 was pulled down by CEP135 at day 35. Further confirmations are 


Table 2. Overlapping neurodevelopmental disease cohorts and centrosome proteomes. Shown is analysis of the de novo variants per disease gene 
set per protein list, assessed by means of exact binomial test with Benjamini-Hochberg correction. P values were calculated by means of exact binomial test 
(two-tailed) with Benjamini-Hochberg correction (FDR 0.05). ASD, autism spectrum disorder; PH, periventricular nodular heterotopia; !D, intellectual disability; 


EE, epileptic encephalopathy; PMG, polymicrogyria. 


Centrosome = NSCs microtubule-independent e 
datasets (n = 3165) NSCs (n = 751) (n = 625) Neurons (n = 786) 
Disease Expected Observed Expected Observed Expected Observed Expected Observed 
P vali P val P valu P value 
gene-set* events events events events events events events events 


PMG (n = 86) 


*Number of individuals (n). Significant P values. 


and RBM8A), RNA binding protein Roquin-1 
(RC3H1), translation regulators FMRP and 
FXR2P, RNA processing complex member 
AGOI, and the nucleoporin NUP50 (Fig. 2, C to 
K, and fig. S5) [other nucleoporins at the cen- 
trosome are available in (26)] was confirmed 
in cultured cells and human fetal cortex sam- 
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ples (fig. S5, I and J). For the latter, we chose 
gestational week 18 as a later stage of cortex 
neurogenesis, with many neurons still migrat- 
ing, which would be most comparable with the 
stages analyzed in vitro. Thus, three sets of 
analyses confirm the reliability and specificity 
of our centrosome interactome analysis. 
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Significant overlap with specific 
neurodevelopmental disease cohorts 

We next asked whether these neural proteome 
datasets could be used to inform on genetic 
variants of unknown etiological relevance in 
individuals with neurodevelopmental disease. 
The proteins identified in our centrosome 
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as 
Table 3. PH-associated de novo variants within individual bait interactomes. Analysis of de 
novo variants for PH gene-set within the proteome of individual baits in NSCs, assessed by means of 
exact binomial test with Benjamini-Hochberg correction (FDR 0.05). 


NSC, all 


Bait protein P value 


0.0107* 


POC5 (n = 106) 


*Significant P values. 


proteomes and other publicly available cen- 
trosome interactors (14-18) were overlaid 
with genes harboring rare de novo variants 
(DNVs) identified in patients with various neuro- 
developmental disorders that still await genetic 
diagnosis (27-34). Comparing the overlap of 
the centrosome proteomes with neurodevel- 
opmental disease cohorts identified several 
significant overlaps (Table 2 and table S5) 
that are beyond that expected from natural 
genetic variation (35). First, we observed that 
autism spectrum disorder (ASD) DNVs showed 
significant enrichment in all centrosome data- 
sets, supporting pancellular centrosome pro- 
teins in disease etiology. Another significant 
association was observed between DNVs in 
patients with intellectual disability (ID) and 
both published centrosome datasets and our 
neuronal centrosome proteome. Because neu- 
rons do not divide, neuronal centrosomes may 
be particularly relevant for ID owing to their 
role in cilia formation and function. Conversely, 
only the NSC centrosome proteome was sig- 
nificantly enriched for proteins encoded by 
loci with DNVs in the PH cohort databases 
(Table 2). The failure of some cells to move 
away from the ventricular lining in PH (36) 
may relate to the centrosomal MTOC activity in 
NSCs mediating delamination of cells from the 
ventricle (6, 37). Consistent with this hypothe- 
sis, the majority (88%) of the NSC centrosome 
proteins with DNVs in PH were associated 
with baits located at microtubule-anchoring 
centrosome positions (Table 3 and table S5). 
Almost all (15 of 16) of these proteins driving 
the PH association were interacting with the 
centrosome in a microtubule-independent 
manner (still present in the nocodazole-treated 
condition) and hence are direct centrosome 
interactors. Taken together, these data sug- 
gest a link between our neural centrosome 
data and specific neurodevelopmental dis- 
eases, with proteins of the NSC and neuro- 
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NSC, microtubule-independent 


Bait protein 


Centrobin (n = 3) 


POCS (n = 86) 


nal centrosome proteome enriched in distinct 
disease cohorts. 


PRPF6 variant recapitulates aspects of PH 


To determine whether centrosome association 
of certain proteins indeed helps prioritize DNVs 
with functional relevance, we investigated the 
dynamic enrichment of PH DNVs within the 
centrosome proteomes. Among the microtubule- 
independent NSC centrosome interactors with 
PH DNVs, we found four members of an RNA- 
processing complex: ACIN1, DDX23, KIAA1429, 
and PRPF6 (Table 2 and tables S1 and S3). 
Members of this complex were significantly 
enriched within a set of 40 candidate PH genes 
whose expression patterns mimic those of 
known PH loci within human brain transcrip- 
tomic data, supporting a relationship to the 
disease (fig. S6, A and B, and table S6). This 
prompted us to focus our analysis on the ubiq- 
uitously expressed protein PRPF6 because its 
centrosomal localization along with its asso- 
ciated PH interactors may explain how mutations 
in this complex induce neurodevelopmental 
phenotypes. As predicted by the proteome 
analysis and confirmed with down-regulation, 
PRPFG is enriched at the centrosome of NSCs 
and binds centrosomal and RNA-interacting 
proteins (Fig. 3, A to F, and fig. S6, E to H). 
Affinity purification of the RNA binding pro- 
tein PRPF6 within the human iPSC-derived 
NSCs pulled down 297 proteins, of which 
111 were shared with centrosome proteome 
(tables S1 and S7), and included a protein 
complex significantly enriched for genes with 
DNVs in patients with PH (fig. S6C). This 
reinforces the plausibility of a contribution of 
PRPF6 centrosomal localization to the dis- 
ease phenotype caused by this otherwise ubiq- 
uitous protein. 

The DNV in PRPF6 was identified in a male 
patient born from healthy nonconsanguineous 
parents, was diagnosed with delayed develop- 
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mental milestones, and had experienced a single 
convulsive seizure at 3 years of age. He had 
severe ID and was nonverbal. Head circum- 
ference was at the seventh percentile; brain 
magnetic resonance imaging (MRI) showed 
bilateral PH with mildly enlarged Sylvian fis- 
sures, and mild frontal lobe and cerebellar hy- 
poplasia. Specifically, the patient has a single, 
rare (not observed in large genomic sequence 
datasets) de novo missense variant [c.67C>T, 
p.Arg23Trp; RefSeq NM_0124.69.4 (GRCh37)] 
localized in the Prp1 domain of PRPF6 that 
directly targets RNA for splicing (38). The 
variant is predicted to be deleterious on the 
basis of the high Polyphen score and low re- 
sidual variation intolerance score (32). 

In the developing mouse brain, Prpfé is 
expressed in both neurons and progenitors 
(fig. S6D), which is consistent with its overall 
ubiquitous expression (39). Following previ- 
ous modeling of PH in the developing mouse 
brain (40-42), we used in utero electroporation 
(UE) to introduce constructs expressing either 
control [green fluorescent protein (GFP)], wild 
type (PRPF6™"), or PRPF6®”? mutant [in 
which arginine (R) at position 23 is replaced 
with tryptophan (W)] into the mouse cortex 
at embryonic day 13 (E13) (fig. $7, A to C). 
Analysis at 3 days after electroporation (at 
E16) showed significantly more GFP* cells 
expressing PRPF6***™ in the periventricular 
area (Bins 1 and 2, comprising the ventricular 
and subventricular zones, respectively), with 
fewer cells reaching the neuronal layers in the 
cortical plate (Bins 4 and 5) relative to the cells 
expressing the wild-type form (fig. S7, A to 
D). Most of the cells expressing PRPF6*?°” 
that were stuck in the subventricular zone 
succumbed to cell death (fig. $7, Eto H), and 
by 5 days after electroporation at E18, most 
GFP* cells had reached the outer bins in all 
three conditions (Fig. 3, G to J). However, a 
significantly increased fraction of cells ex- 
pressing PRPF6"?3” remained located at the 
periventricular area (Fig. 3, I and J) in a pattern 
reminiscent of the heterotopia in PH patients. 
Although this phenotype may not reflect all 
aspects detected in human patients, the finding 
of only a minority of cells placed ectopically in 
periventricular regions, whereas most made it 
into a normal-appearing grey matter, reflects a 
common hallmark in PH. 

Immunostainings for the NSC marker PAX6, 
the progenitor marker TBR2, and the neuronal 
marker TBR1 revealed the mixed composition 
of the periventricular GFP* cells in all three 
conditions at E18 (Fig. 3, K to S). Most were 
PAX6* (Fig. 3, K to M and Q), many were TBR2* 
(Fig. 3, K to M and R), and some were TBR1* 
(Fig. 3, N to P and S). However, the proportion 
of PAX6* NSCs was significantly decreased, 
whereas neurons were increased in the 
PRPF6"™” condition (Fig. 3, Q and S). Thus, 
deficits in delamination and/or migration 


5 of 10 


RESEARCH | RESEARCH ARTICLE 


ao 
A BY Cc D 3 E < 
. 3 a sr s 3 
2 E 70 aa -~ OM ws a 
9 2 =£ OY £m. ££ SM ws 
— 60 
8 = 100- PRPFG6 —-250-}agmmeeme | KIAA1429  100- - PRPF6 
5 2 50 : = 
os) o 4100-| ACIN4 100-| LEPRE1 5 Zz 
2 a 40 oa 
5 f 100-\—~ WR [KiAA1429 100-\qge= | RBMA5 
= ‘5 30 100- PRPF6 
® Centr. No-col 
aN 
G H J K N pCAG_GFP 
pCAG_GFP pCAG_PRPF6"" pCAG_PRPF62” ye ee ee 
a c 
ie) ke [4 
ie © a 
CO Ree —i‘“(‘(‘éi «A ee, eee 
Z : 
bl ° o e) 
£ . rs 
re a ) 
(U) ry L pCAG_PRPF6 O 
© c N 4 
nT oo fad IZ Ps 
ud co Zz 
bad n pCAG_GFP = i) Bi 
ui a G=ipCAG_PRPF6™ 75 Baia 
ll pCAG_PRPFe™w  & u 
= L 2 
£ oy ---: 
a 


pDCX_GFP 


Vv 
pDCX_PRPF6"" pDCX_PRPF6"2" 


0 10 20 30 40 50 60 


GFP+ cells at 5 dpe (%) 


GFP PAX6 TBR2 


pDCX_GFP 
[=] pDCX_PRPF6"™* 
I pDCX_PRPF6"2" 


Bin1 Bin2 Bin3 Bin4 Bin5 


10 20 30 40 50 60 70 


P 


GFP TBR1 


” 


% TBR1* GFP* 


20 
15 

104 , 

5 e 

0 


Fig. 3. Centrosomal PRPF6 and its role in PH. (A) E14 mouse cortical cells 

at 3 days in vitro stained as indicated, and (B) colocalization quantified (n = 300 
in three independent replicates indicated as mean + SD). Scale bars, 2.5 wm. 
(C to F) Coimmunoprecipitation (immunoprecipitation indicated at top) followed 
by Western blot (antibodies indicated at right) from day 15 iPSC-derived NSCs. 

(G to I, K to P, and T to V) Coronal sections of E18 mouse cerebral cortices 
electroporated at E13 with [(G), (K), (N), and (T)] GFP-only control, [(H), (L), 
(0), and (U)] PRPF6™", or [(1), (M), (P), and (V)] PRPF6°2S under [(G) to (I) and 


(K) to (P)] CAG or [(T) to (V)] doublecortin promoter, im 
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rather than a failure to differentiate seem to 
be involved in the periventricular cell posi- 
tioning. Therefore, we aimed to determine 
whether the ectopic positioning would also 
occur when GFP, PRPF6™", and PRPF6®”?” 
were expressed only in young neurons and 
differentiating progenitors under the double- 
cortin regulatory elements (6). IUE at E13 
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followed by analysis at E18 showed no sig- 
nificant difference in the distribution of GFP* 
cells for any of the conditions and no ectopic 
cells in the lower bins (Fig. 3, T to W), sug- 
gesting that placement of cells expressing 
mutant PRPF6 in the periventricular region 
occurs at earlier stages, before neuronal differ- 
entiation. This finding is in agreement with the 


2022 


preferential interaction of the PRPF6 splicing 
complex with the NSC compared with the neu- 
ronal centrosome. 


Correctly spliced Brsk2 rescues 
PRPF6-induced PH 


To better understand the etiology of this phe- 
notype, we explored the role of PRPF6 as a 
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Fig. 4. PRPF6"25™ affects splicing in the PH phenotype. (A) Schematic 
representation of the experimental protocol. (B) Summary of the splicing 
changes (indicated as numbers) in cells expressing PRPF6***"’ versus PRPF6"”. 
SE, skipped exon; RI, intron retention; MXE, mutually exclusive exon; A5SS, 
alternative donor site; A3SS, alternative acceptor site. PSI; percent spliced-in. 
(C) GO analysis (biological processes) of genes with SE or RI in PRPF6*@” 
versus PRPF6"'-expressing cells. (D) Quantification of the number of genes 
differentially spliced and preferentially expressed in the indicated regions. 

P values with scale are shown on right y axis as red bars (Fisher's Exact test, 
two-tailed, with Benjamini-Hochberg correction). (E) Exons (boxes) encoding the 


regulator of the spliceosome machinery (38, 43). 
To do so, we performed RNA-sequencing on 
flow cytometry-purified GFP* cells at 1 day 
after electroporation (at E14), before any 
phenotype could be observed (Fig. 4A and 
fig. S8, A to D). Only two genes (VCAMI1 and 
a collagen) were differentially expressed be- 
tween PRPF6™’- and PRPF6*”*”-expressing 
cells (fig. SSE). Using the mixture-of-isoforms 
(MISO) statistical model, which assigns a 
“percentage spliced in” (PSI) value to each 
splicing event (44), and choosing the stringent 
Bayes factor >5, a total of 182 alternative splice 
events in 166 separate genes were found to be 
significantly changed between PRPF6™" with 
PRPF6"”?” GFP* cells (Fig. 4B). These changes 
encompassed all types of alternative splicing 
events: 101 alternative cassette exons, 37 intron 
retention events, five mixed spliced events, as 
well as 13 and 26 events for alternative donor 
and acceptor sites, respectively (Fig. 4B). Cells 


expressing PRPF6*”?” showed a bias toward 
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(lines). (F) Regiona 


two categories: cassette exon-skipping and 
intron retention (Fig. 4B), as validated with 
quantitative reverse transcription polymerase 
chain reaction (RT-PCR) (fig. S8F). This is con- 
sistent with the role of PRPF6 as a core splic- 
ing component. 

GO term analysis for the genes identified 
with skipped exons (SEs) or retained introns 
(RIs) revealed enrichment for categories gov- 
erning central nervous system development 
and cell-cell adhesion among SE genes, whereas 
RI genes were enriched for chromatin silenc- 
ing and RNA metabolic processing (Fig. 4C). 
To probe when the genes misspliced upon 
PRPF6*?” expression may have the greatest 
effect, we examined their expression using 
data from the developing mouse cortex (45). 
This showed that genes with skipped exons 
induced by expression of PRPF6*?°” were 
enriched for loci with significantly greater ex- 
pression during migration (Fig. 4D). To prior- 
itize possible candidate genes mediating the 
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Brsk2_202 transcript isoforms (red: skipped exon in PRPF6"@>"’ cells), introns 
expression of Brsk2 isoforms (23). (G to I) Coronal sections of 
E18 mouse cortices coelectroporated at E13 as indicated at top. (J) Quantification 
of (G) to (I) (n = embryo; mean + SD; unpaired two-tailed Kruskal-Wallis test 
followed with Dunn's multiple comparison; *P < 0.05). (K and L) Single-molecule 
FISH (magenta) and immunostaining in embryonic mouse cortical cells 

(3 days in vitro). The white dashed boxes are expanded in the bottom insets. 

(M) Quantification of (K) and (L) (n = 300 cells from three independent cultures; 
mean + SD; unpaired one-tailed Mann-Whitney test; * P < 0.05). Scale bars, (G) to 
(I) 100 um; (K) and (L) 2.5 um. Abbreviations are as in Fig. 3. 


PH phenotype, we combined the two main en- 
richment analyses from Fig. 4, C and D, which 
identified Ctip2 and Brsk2. We selected Brsk2 
because it encodes the SAD-A kinase phos- 
phorylating microtubule-associated proteins 
(MAPs), regulating microtubule dynamics 
(46, 47) and neuronal migration in the de- 
veloping cerebral cortex (48). 

Of the three Brsk2 isoforms expressed within 
the developing mouse brain (23), exon 19 of 
isoform Brsk2_202 [RefSeq NM_001009930.3 
(GRCh37)] is skipped in the mutant condition 
(Fig. 4E and fig. S8L). Brsk2_202 is expressed 
in cells that leave the ventricle, whereas the 
isoform Brsk2_201 [RefSeq NM_001009929.3 
(GRCh37)] is expressed in all zones [data are 
from (23)] (Fig. 4F). To test whether Brsk2_202 
plays a role in mediating exit from the ven- 
tricular region, PRPF6®”°” and Brsk2_202 
were coelectroporated at E13. This resulted in 
correct cellular distribution within the devel- 
oping cortex 5 days after electroporation (at 
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E18) (Fig. 4, G to J), whereas coelectroporation 
of the misspliced Brsk2_202 lacking exon 19 
(Brsk2_202“"°) did not rescue the PRPF6*??” 
periventricular phenotype (Fig. 4, I and J). 
These data implicate the deficiency of this iso- 
form in cells failing to leave the periventricular 
region and link microtubule-associated pro- 
cesses in migration out of the periventricular 
region in causing PH phenotypes. 

These findings prompted the question of 
whether NSC centrosome-associated proteins 
in the highest enrichment category, “splicing,” 
bring their target RNAs to the centrosome. 
Splicing normally takes place in the nucleus, 
but the dynamic centrosome association of 
the PRPF6 complex (which includes ACIN1, 
DDX23, and KIAA1429 as well as exon junction 
complex proteins) suggests that RNA process- 
ing, transport, and/or translation modulation 
may be locally regulated by PRPF6. Consistent 
with this, we detected Brsk2 RNA by means of 
single-molecule fluorescence in situ hybridiza- 
tion (FISH) and high-resolution imaging in the 
proximity of the centrosome in 28.3% of mouse 
embryonic cortex cells compared with control 
Polr2a RNA (9.3%) (Fig. 4, K to M). RNAs 
encoding centrosomal proteins have been 
found at the centrosome in a polysome and 
translation-related manner (49, 50). These data 
are consistent with a role of the PRPF6 RNA 
binding and RNA-processing protein complex 
at the centrosome, shedding light on howa 
mutation of this ubiquitous protein causes a 
phenotype in the developing brain. 


Discussion 


We used affinity-based proteomics on human 
iPSC-derived NSCs and neurons targeting 
10 core proteins to obtain a spatial portrait 
of the centrosome proteome. This led to the 
discovery of hundreds of neural centrosome 
interactors that were not reported in other 
centrosome proteomes. Further, this work 
uncovered dynamic changes of more than half 
of the centrosome proteome at specific baits 
during neuronal differentiation. Overlaying 
this interactome with DNVs of unknown im- 
portance from distinct neurodevelopmental 
disorders identified an enrichment for variants 
found in individuals with PH within the NSC 
centrosome proteome. This overlap was not 
observed for other cell types, including neu- 
rons profiled with the same method, which 
supports centrosome cell-type specificity to be 
relevant for neurodevelopmental disorders. 
The centrosome localization of interactors was 
not restricted to mitosis—as described, for ex- 
ample, for transcriptional regulators localiz- 
ing to the centrosome or spindle apparatus in 
mitosis (57)—but was rather found in inter- 
phase, like AKNA (6). Significant enrichment 
of RNA binding and RNA-processing pro- 
teins is prominent in the neural centrosome 
proteome, and their disease relevance is high- 
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lighted by the splicing complex formed by 
PRPF6 with ACINI, DDX23, and KIAA1429. 
Modeling the disease contribution of the 
PRPF6 mutation detected in a PH patient, our 
work indicates how ubiquitously expressed 
genes can contribute to specific disease pheno- 
types through differential protein network 
interactions across cell types. 

We report the predominance of RNA binding 
and RNA-modifying proteins, including factors 
involved in mRNA splicing, RNA transport, and 
regulation of translation at the neural centro- 
some proteome, which were not detected in 
other centrosome proteomes. For example, the 
three fragile X syndrome proteins—FXRIP, 
FXR2P, and FMRP—regulate several RNA 
processes, including translation, transport, 
and editing (52-57). Given their link to ASD, 
exploring their centrosomal function in neural 
iPSC-derived cells as well as in fetal tissue could 
elucidate the neurodevelopmental contribution 
to this condition. Roquin-1 is an RNA binding 
protein that mediates degradation of its tar- 
gets and was also detected and validated at 
the neural centrosome, along with its inter- 
actor NUFIP2 (58). The recently shown bind- 
ing of Roquin-1 to Akna RNA (59) would be 
consistent with a role in regulating centro- 
somal MTOC through RNA regulation at the 
centrosome. Specific mRNA transcripts have 
been shown to localize at this organelle (such 
as PCNT) (50, 60, 61), where their local protein 
translation is detected (62). We demonstrated 
that the RNA for a MAP kinase (SAD-A en- 
coded by Brsk2), a splicing target of the PRPF6 
complex, also localizes to the centrosome, ex- 
panding the concept of function of specific 
RNAs at this location. 

The concept of regulating centrosomal MTOC 
activity also through local RNAs is further 
supported by the localization of most of the 
proteins with PH variants at centrosome baits 
of the appendages or pericentriolar material 
(PCM) where microtubules are anchored, in- 
cluding all components of the PRPF6 com- 
plex. Centrosomal MTOC activity has been 
shown to be essential for newly born basal 
progenitors to migrate away from the brain’s 
ventricle (6, 37). The PRPF6 complex interacts 
with the centrosome components involved in 
regulating MTOC, with the de novo PRPF6®”” 
variant identified in a patient with PH in- 
creasing the number of cells remaining in 
the periventricular region (6, 37). Like AKNA, 
PRPF6 localizes to the centrosome during 
interphase and promotes cells’ migrating out 
of the periventricular region. For both proteins, 
this role occurs before neuronal differentiation 
because expression under a neuronal promoter 
failed to elicit a phenotype. The rescue of the 
heterotopia only with the correctly spliced 
form of Brsk2, but not the one lacking exon 19, 
further supports the functional relevance of 
these proteins and their target RNAs at the 
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centrosome for disease. Nascent proteins may 
exert local functions such as phosphorylation 
of dynamic microtubule-associated components 
at the centrosome [reviewed in (63)]. Thus, lo- 
calization of ubiquitously expressed proteins 
from the PRPF6 complex at the centrosome in 
NSCs, but not other cells, correlates with their 
involvement in PH. This not only identifies the 
microtubule-anchoring region of the centrosome 
as a hub for PH disease variants but also 
sheds light on how mutations in genes that 
encode widely expressed proteins can lead to 
disorders restricted to the developing brain. 


Methods summary 
Cell culture 


Cortical NSCs and neurons were differentiated 
from human iPSC lines by using a dual-SMAD 
inhibition protocol (7) with modifications. 
Cellular identity was confirmed with quanti- 
tative RT-PCR and immunostaining. 


Coimmunoprecipitation 


For proteome analysis, cells were harvested at 
days 15 (NSCs) or 40 (neurons) of differenti- 
ation after treatment with dimethy] sulfoxide 
(DMSO) or 3.3 uM nocodazole (NSCs only). 
Cell lysates, each containing 5 mg total pro- 
tein, were incubated for 1 hour with one of the 
10 centrosomal bait antibodies and 2 more 
hours after adding Protein A and Protein G 
Dynabeads, with end-to-end rotation at 4°C. 
Immunoprecipitated lysates were washed with 
lysis buffer, dissociated by boiling in Laemmli 
buffer, and stored at -80°C until mass spec- 
tometry. Using the same procedure, negative 
controls were prepared for each of the four 
replicates parallel to the samples, but bait 
antibodies were omitted. 


Mass spectrometry 


Immunoprecipitates were analyzed with mass 
spectrometry, followed by processing with 
MaxQuant software (1.6.17.0). Protein enrich- 
ment within each inmunoprecipitation was 
calculated with Perseus software (1.6.14.0) by 
using LFQ intensities through unpaired one- 
tailed Student’s ¢ test against the negative 
controls. GO enrichment of the protein lists 
was calculated by using the Search Tool for 
the Retrieval of Interacting Genes/Proteins 
(STRING) database. 


Burden analysis 


Disease set enrichment analyses were carried 
out by using exact binomial test (two-tailed) 
with Benjamini-Hochberg correction as de- 
scribed previously (35), using published de novo 
variants for ASD, PH, ID, epileptic encepha- 
lopathy (EE), and polymicrogyria (PMG) (27-34). 


Immunostaining and single-molecule FISH 


Cortical sections and cells were incubated 
overnight in blocking solution and primary 
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antibody at 4°C. The day after, they were 
stained with secondary antibodies diluted in 
blocking solution and incubated for 1 to 2 hours. 
For single-molecule FISH combined immuno- 
fluorescence, cells were incubated with pri- 
mary antibody in bovine serum albumin (BSA) 
and Triton X-100 in 1x phosphate-buffered 
saline (PBS) and stained with secondary anti- 
bodies as described above. After secondary 
antibody incubation, cells were hybridized 
with RNA probes overnight at 37°C and 
thoroughly washed before embedding. Nuclei 
were visualized by using 4’,6-diamidino-2- 
phenylindole (DAPI). 


Western blot 


Immunoprecipitated samples were ran on 
6 to 12% SDS gels (depending on the protein 
size) and then transferred to nitrocellulose 
membranes. For immunodetection, mem- 
branes were first blocked for 1 hour, incu- 
bated overnight with primary antibodies, and 
then washed three times with 1x tris-buffered 
saline-Polysorbate 20 (TBST) before being in- 
cubated with horseradish peroxidase (HRP)- 
coupled secondary antibodies. The blots were 
visualized by means of the enhanced chemilu- 
minesence (ECL) method, using a ChemiDoc 
instrument. 


IUE 


Endotoxin-free vectors were diluted to 0.5 to 
0.7 ug/l each in 0.9% NaCl and mixed with 
Fast green, and 1 ul of mix was injected into 
the ventricles of embryos at E13 in the uterus 
of anesthetized C57/Bl6 mice and electro- 
porated. Embryonic brains were dissected 1, 
3, or 5 days after electroporation and fixed 
with 4% paraformaldehyde (PFA) in 1x PBS 
for 2 hours (1 day after electroporation), 4 hours 
(3 days after electroporation), or 6 hours (5 days 
after electroporation). For analysis, embryos 
from at least two females were used, and quan- 
tifications were made from two to three coronal 
sections from four to six embryos. Statistical 
differences were assessed by means of unpaired 
Kruskal-Wallis tests followed by Dunn’s mul- 
tiple comparison correction. 
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Control of meiotic chromosomal bouquet and germ 
cell morphogenesis by the zygotene cilium 


Avishag Mytlist, Vineet Kumar}, Tao Qiut, Rachael Deis, Neta Hart, Karine Levy, Markus Masek, 
Amal Shawahny, Adam Ahmad, Hagai Eitan, Farouq Nather, Shai Adar-Levor, Ramon Y. Birnbaum, 
Natalie Elia, Ruxandra Bachmann-Gagescu, Sudipto Roy, Yaniv M. Elkouby* 


INTRODUCTION: Meiosis is a cellular program 
that is essential for producing haploid gametes 
and is conserved in sexually reproducing or- 
ganisms. A hallmark of meiosis is chromosomal 
pairing through synaptonemal complexes and 
meiotic cohesins, which is required for the ex- 
ecution of homologous recombination. How- 
ever, nuclear events of meiosis occur in the 
cellular context of a differentiating gamete, 
where chromosomal pairing depends on cyto- 
plasmic counterparts. In pairing, telomeres 
perform a unique function: They slide on peri- 
nuclear microtubules, shuffling chromosomes 
and mechanically driving their homology 
searches. Telomeres tether to Sun/Kash com- 
plexes on the nuclear envelope (NE), which 
associate with perinuclear microtubules (MTs) 


through dynein. This facilitates telomere rota- 
tions around the NE, which in turn shuffle 
chromosomes, driving their search for homo- 
logs. The perinuclear MTs at these stages em- 
anate from the centrosome MT organizing 
center. Ultimately, rotating telomeres are 
pulled toward the centrosome and cluster 
on this side of the NE while looping their 
chromosomes to the other side, a configura- 
tion called the zygotene chromosomal bou- 
quet. These telomere dynamics are essential 
for pairing and fertility. The bouquet, discov- 
ered in 1900, is universally conserved and was 
proposed as a hub for chromosomal pairing. 
Nevertheless, how cytoplasmic counterparts of 
bouquet formation are mechanically regulated 
has remained enigmatic. 


Zygotene bouquet-stage oocyte. Shown is an oocyte at the zygotene bouquet stage, where chromosomes 
(green) are tethered on the NE through their telomeres, which cluster apposing the centrosome (yellow 
circle), with their free looping ends facing the other side. The bouquet cytoplasmic machinery is shown as 
a cable system composed of the perinuclear MTs (blue cables) that are organized from the centrosome, 
which connects to the zygotene cilium (purple cable). 
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RATIONALE: Here, we report the “zygotene cil- 
jum,” a previously unrecognized cilium, in 
oocytes. Using quantitative three-dimensional 
confocal image analysis, serial block-face scan- 
ning electron microscopy, and live microscopy, 
we characterized in zebrafish the intracellular 
and intercellular organization of the zygotene 
cilium, as well as its developmental stage- 
specific formation during oogenesis. We show 
that the zygotene cilium specifically connects 
to the bouquet centrosome, constituting a cable 
system of the cytoplasmic bouquet machinery. 
Furthermore, zygotene cilia extend throughout 
the germline cyst, a conserved germ cell orga- 
nization. We reasoned that the zygotene cil- 
ium could regulate bouquet formation and 
germline cyst cellular organization. 


RESULTS: We analyzed multiple viable loss-of- 
function mutations in zebrafish ciliary genes 
that in humans cause ciliopathies, genetic dis- 
orders that are caused by ciliary defects, in- 
cluding cep290, kif7, and cc2d2a. Using these 
mutants, as well as laser-induced excision of 
the zygotene cilium and live time-lapse imag- 
ing, we demonstrate that the organelle is 
essential for chromosomal bouquet and syn- 
aptonemal complex formation, germline cyst 
morphogenesis, ovarian development, and fer- 
tility. Mechanistically, we provide evidence 
that the cilium functions by anchoring the 
bouquet centrosome to counterbalance telo- 
mere rotation and pulling. We further show 
that the bouquet configuration is required for 
proper synaptonemal complex formation. Loss 
of the zygotene cilium resulted in bouquet and 
germline cyst perturbations that in turn in- 
duced oocyte apoptosis. Crossing the ciliary 
mutants to tp53 mutant fish demonstrated 
that they induce P53-dependent apoptosis, 
likely by activating a meiotic checkpoint. Ulti- 
mately, loss of the zygotene cilium leads to 
ovarian dysgenesis and deficient fertility in 
adult females. We also show that the zygotene 
cilium is conserved in both male and female 
meiosis in zebrafish, as well as in mammals. 


CONCLUSION: Our work uncovers the concept 
of the cilium as a critical player in meiosis and 
provides evidence that the bouquet is func- 
tionally required for synaptonemal complex 
formation. Our findings further shed new light 
on reproduction phenotypes in ciliopathies 
and propose a cellular paradigm that cilia can 
control chromosomal dynamics. 
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Control of meiotic chromosomal bouquet and germ 
cell morphogenesis by the zygotene cilium 
Avishag Mytlis'?}, Vineet Kumar}, Tao Qiu*+, Rachael Deis’, Neta Hart”, Karine Levy??, 
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Sudipto Roy*”*, Yaniv M. Elkouby"2* 


A hallmark of meiosis is chromosomal pairing, which requires telomere tethering and rotation on the 
nuclear envelope through microtubules, driving chromosome homology searches. Telomere pulling 
toward the centrosome forms the “zygotene chromosomal bouquet.” Here, we identified the “zygotene 
cilium” in oocytes. This cilium provides a cable system for the bouquet machinery and extends 
throughout the germline cyst. Using zebrafish mutants and live manipulations, we demonstrate that the 
cilium anchors the centrosome to counterbalance telomere pulling. The cilium is essential for bouquet 
and synaptonemal complex formation, oogenesis, ovarian development, and fertility. Thus, a cilium 
represents a conserved player in zebrafish and mouse meiosis, which sheds light on reproductive 
aspects in ciliopathies and suggests that cilia can control chromosomal dynamics. 


eiosis is a cellular program essential 

for producing haploid gametes, con- 

served in sexually reproducing or- 

ganisms. During prophase stages of 

meiosis, induced double-strand breaks 
must be repaired, and their repair through 
homologous recombination requires the pair- 
ing of homologous chromosomes and forma- 
tion of the synaptonemal complex (SC). The 
SC is composed of SC proteins Sycp1 to Sycp3 
that assemble between chromosomes and 
along their axes (J). Together with meiotic 
cohesins, these connect homologous chro- 
mosomes and hold them together for the 
execution of recombination (J, 2), and recent 
breakthroughs have revealed their structure 
and functions (/-8). 

However, the nuclear events of meiosis oc- 
cur in the cellular context of a differentiating 
gamete (9), where chromosomal pairing de- 
pends on cytoplasmic counterparts. In pair- 
ing, telomeres perform a unique function: At 
the leptotene stage, telomeres tether to Sun/ 
Kash complexes on the nuclear envelope (NE), 
which associate with perinuclear microtubules 
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(MTs) through dynein (10-17). This facilitates 
telomere rotations around the NE during the 
leptotene-zygotene stages (fig. SIA), which in 
turn shuffle chromosomes, driving their search 
for homologs. The perinuclear MTs at these 
stages emanate from the centrosome MT- 
organizing center (MTOC) (J6, 18, 19) (fig. 
S1A). Ultimately, rotating telomeres are pulled 
toward the centrosome and cluster on this side 
of the NE while looping their chromosomes 
to the other side, a configuration called the 
zygotene chromosomal bouquet (20, 21) (fig. 
S1A). These telomere dynamics are essential 
for synapsis, homologous recombination, and 
fertility (J0-17). Telomere clustering is thought 
to stabilize initial pairing between homolo- 
gous chromosomes, and in zebrafish, the 
bouquet was proposed as a hub for chromo- 
somal pairing (22). It was shown that double- 
strand breaks cluster at subtelomeric regions 
and that Sycp3 and Sycp! first load adjacent to 
clustered telomeres specifically at the bouquet 
stages, from which they extend along chromo- 
somal axes (22, 23) (fig. SIA). 

The bouquet configuration was discovered 
in 1900 (20) and is conserved from yeast to 
mammals (10-17). However, how the cytoplas- 
mic counterparts of bouquet formation are 
mechanically regulated in rotating telomeres 
is unknown. It is unclear how rotation forces 
are generated, counterbalanced to allow move- 
ments, and then halted at the correct stage. 

We previously characterized the nuclear and 
cytoplasmic components of the bouquet and 
its formation in zebrafish oogenesis (19, 24) 
(fig. SIB). To obtain a better temporal resolu- 
tion of bouquet formation, we performed live 
time-lapse analysis of bouquet chromosomal 
rotation in cultured ovaries (24) (fig. S2 and 
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movie S1). This analysis revealed rapid rota- 
tional movements, specifically in leptotene- 
zygotene oocytes, in contrast to oogonia and 
somatic pre-granulosa follicle cell nuclei, 
which do not execute meiotic chromosomal 
pairing (fig. S2 and movie S1). These rapid 
rotational movements suggest the underlying 
action of substantial mechanical forces. Rotat- 
ing telomeres then cluster, strictly juxtaposing 
the centrosome (18, 19, 23) (fig. SIA). Live 
recording of the centrosome during bouquet 
chromosomal rotations showed that it vib- 
rates in place, remaining fixed at the same 
position in the cytoplasm (movie $2). There- 
fore, as in mice (/8), telomeres rotate and are 
pulled by MTs toward the centrosome to clus- 
ter. These observations suggest that (i) MTs 
and the centrosome are subjected to mecha- 
nical strain during rotations, and (ii) the ce- 
ntrosome is likely to be physically anchored. 


The zygotene cilium 


We detected elongated fiber-like structures of 
acetylated tubulin that emanated from the 
centrosome in bouquet stage oocytes with 
clustered telomeres (J9). These structures 
were highly reminiscent of cilia. A cilium is 
an MT-based organelle composed of an axo- 
neme with stereotypically organized acety- 
lated microtubule doublets that grow from 
the mother centriole (basal body) and extend 
extracellularly (25, 26). 

Several lines of evidence demonstrated that 
the bouquet acetylated tubulin structures are 
cilia: (i) The acetylated tubulin (AcTub)- 
positive structures colocalized with the spe- 
cific ciliary membrane marker Arl13b (27, 28) 
(Fig. 1, A and C); Gi) tubulin glutamylation 
(GluTub), a posttranslational MT modification 
of ciliary axonemes, showed similar struc- 
tures (Fig. 1B) that colocalized with the AcTub 
signal (Fig. 1C); (iii) transgenic Arl13b-GFP in 
zygotene oocytes confirmed consistent ciliary 
structures in live ovaries (fig. S3B); and (iv) 
transmission electron microscopy (TEM) of 
ovaries identified structures that correspond 
to ciliary axonemes with typical MT doublets 
that are found in spaces between cell mem- 
branes of adjacent zygotene oocytes (Fig. 
1D, bottom panels, and fig. S3A). Within oocytes, 
we identified the basal body, transition zone, 
and axoneme extending extracellularly through 
the ciliary pocket (Fig. 1D, top panels, and 
fig. S3A). Serial block-face scanning electron 
microscopy (SBF-SEM) confirmed axonemal 
structures that emanated from basal bodies 
in prophase oocytes (movies S3 to S5). 

The cilia were stage specific: They were 
specifically present at the leptotene-zygotene 
stages as determined by our previously estab- 
lished staging criteria (19, 24) (fig. SIB). This 
was confirmed by the prophase SC marker 
Sycp3 (Fig. 1E and fig. S1B). Cilia were absent 
from most premeiotic oogonial stages, but a 
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Fig. 1. A zygotene-specific oocyte cilium. (A) AcTu 
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zygotene bouquet cyst from SBF-SEM data, showing: cytoplasm of cyst oocytes 
(different transparent colors), nuclei (gray), cilia (maroon; maroon-gray 
arrowhead), and basal body centrosomes (green). Bottom panel: same cyst 
render stripped from cytoplasm to show the cilia. Scale bars are 5 um. (H to 
J) Zygotene bouquet oocytes from ovaries labeled for (Scale bars are 5 wm): (H) 
Telomeres (Telo-FISH) and AcTub showing that the cilium emanates from the 
cytoplasm apposing the telomere cluster. n = 3 ovaries. A snapshot from 3D 
construction in movie S7. (I) yTub and AcTub showing that the cilium emanates 
from the centrosome, which was shown to localize apposing the telomere cluster (10). 
n= 5 ovaries. (J) Microtubules [7g(Bact:EMTB-3XGFP)] showing perinuclear cables 


few short cilia were detected at the leptotene- 
zygotene stage. Cilia were then fully elaborated 
at the zygotene bouquet and disassembled at 
the pachytene stage (Fig. 1, A, B, and F). This 
developmental pattern was consistently de- 
tected using AcTub, Arl13b, or GluTub label- 
ing (Fig. 1, A, B, and F). We conclude that we 
have identified a previously unknown cilium 
in early prophase oocytes that is specific to the 
leptotene-zygotene and zygotene bouquet stages. 

Early prophase, including the leptotene and 
zygotene stages, is executed while oocytes de- 
velop within the germline cyst (9, 19, 24, 29). 
The cyst is a compact and confined cellular 
organization in which cytoplasmic membranes 
of adjacent oocytes interface tightly (19). Zygo- 
tene cilia extended throughout the volume of 
the cyst (fig. S3, B to D). We characterized cil- 
iary distribution in the cyst using SBF-SEM, 
which provided ultrastructural data of cysts at 
EM resolution in three dimensions (3D; movie 
83). Tracking zygotene cilia in raw SBF-SEM 
data (movies S4 and S5) and in rendered lep- 
totene and zygotene cysts in 3D (Fig. 1H and 
movies S3 and S6) confirmed that these orga- 
nelles tangle tightly between oocyte cytoplas- 
mic membranes and extend throughout the 
cyst (Fig. 1G). Measurements of zygotene cilia 
diameters showed the expected diameter of cil- 
iary axonemes (~300 nm) (30) (fig. S5, A and B). 

In the germline cyst, cytoplasmic bridges 
(CBs) are also based on MT cables (79, 31). 
Several lines of evidence allowed us to dis- 
tinguish cilia and CBs in our analysis: (i) Cil- 
iary cytoplasm (TEM and SBF-SEM in Fig. 1D, 
fig. S5A, and movies S4 and S5) but not CB 
cytoplasm (fig. S4A) showed axonemal MT 
tracks and CB membranes were typically 
ruffled (fig. S4A); (ii) CB diameters (fig. S5, A 
and B) were consistent with those in Xenopus 
and mice [0.5 to 1 um (32)] but not with the 
ciliary diameter; (iii) CBs were adjacent to the 
centrosomes/basal bodies (19) but never asso- 
ciated with them; and (iv) cilia did not extend 
through or colocalize with CBs as labeled by 
the midbody marker Cep55 (33) (figs. S4B and 
S5, C and D). Thus, zygotene cilia and CBs are 
distinct and separate structures. 

We confirmed the organization of the zygo- 
tene cilium with the bouquet cytoplasmic 
machinery. Labeling bouquet-clustered telo- 
meres, perinuclear MT, centrosome, and the 


Mytlis et al., Science 376, eabh3104 (2022) 


zygotene cilia (Fig. 1, H to K, fig. S3E, and 
movies S7 to S9) confirmed the cytoskeletal 
arrangement of a cable system that extends 
from the zygotene cilium through the centro- 
some and MT to telomeres on the nuclear 
envelope. This cable system comprises a po- 
tential machinery for chromosomal pairing 
movements (Fig. 1L). Live time-lapse imag- 
ing confirmed that the zygotene cilium was 
present during bouquet chromosomal rota- 
tions (movie S10). 


The zygotene cilium is required for telomere 
clustering in chromosomal bouquet formation 


We next aimed to investigate the function of 
the zygotene cilium in oogenesis. Severe dis- 
ruption to cilia differentiation causes embry- 
onic lethality in zebrafish. However, mutations 
in certain cilia genes allow viability to the 
adult stage. We analyzed multiple such loss- 
of-function ciliary mutants that in humans 
cause ciliopathies, genetic disorders that are 
caused by ciliary defects (9), including cep290 
(34-40), kif7 (41-44), and cc2d2a (45, 46). 

We analyzed zygotene cilia in juvenile 
ovaries of wild-type (wt) as well as cep2907", 
kif and cc2d2a~ mutants (Fig. 2, A to D). 
On the basis of these measurements in 3D 
(Fig. 2A and fig. S6), we defined “normal cilia” 
(4 to 8.5 um), “short cilia” (<4 um) and absent 
cilia [not detected (N.D.)]. We analyzed the 
individual ciliary length (Fig. 2A), average cil- 
iary length per cyst (Fig. 2B), distribution of 
scored cysts (Fig. 2C), and scoring of entire 
ovaries (Fig. 2D). These established the short- 
ening and loss of zygotene cilia in cep290' ~ 
and cc2d2a’~ ovaries. Kif7' /- ovaries showed 
only mild partial ciliary shortening pheno- 
types (Fig. 2, A to D, and fig. S6). Attempting to 
abolish cilia altogether, we generated cep290; 
kif7 double mutants. Indeed, we observed that 
in cep290! “;kif7" - and cep290! ~skif7! ~ Ova- 
ries, cilia were mostly or completely abolished 
(Fig. 2, A to D, and fig. S6). 

We next analyzed bouquet formation upon 
ciliary loss in developing ovaries from juve- 
nile wt and mutant fish. We first noticed fewer 
oocytes with clear telomere clusters in cep290"' = 
ovaries compared with wt, suggesting that 
prophase progression is defective. To distin- 
guish between a delay versus failure in bouquet 
formation we investigated whether (i) oocytes 
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(nucleus is counterstained with DAPI). n = 7 ovaries. A snapshot from 3D 
construction in movie S8. Another example is shown in movie S9. (K) Co-labeling of 
microtubules (EMTB-GFP), yTub, AcTub, and DAPI, reveals that the zygotene cilium 
emanate from the centrosome, which is embedded in a perinuclear microtubule 
network. A merged image and all individual channels are shown. n = 5 ovaries. 
yTub (mlgG1) and AcTub (mlgG2b) antibodies are both isoforms of mlgG. We used 
mlgG secondary antibody that binds both primary yTub and AcTuB antibodies, 

and a mlgG2b secondary antibody that specifically binds to the AcTuB antibody and 
not yTub antibody (arrowheads in the three bottom panels). (L) Scheme showing 
the bouquet cytoplasmic cable system. 


at the leptotene-zygotene stages were delayed 
in bouquet formation, (ii) oocytes at bouquet 
stages formed proper telomere clusters, and 
Gii) late-bouquet oocytes eventually formed 
telomere clusters. 

We addressed a delay in the leptotene- 
zygotene stages. We tested whether oocytes 
that showed radial telomere distribution on 
the NE typical of the leptotene-zygotene stages 
were abnormally larger as an indication for 
prophase progression delay. wt leptotene- 
zygotene oocytes are 7 to 9 um in diameter 
and zygotene bouquet oocytes are 10 to 16 um 
in diameter (Fig. 2, E and F, and fig. S1B) 
(19, 24). In mutants, we identified oversized 
“Jeptotene-zygotene-like” oocytes, defined as 
oocytes with leptotene-zygotene-like radial 
telomere distribution that were larger than 
the largest detected wt oocyte at these stages 
(>9.1 um; Fig. 2, E and F, and fig. S7B). Oocyte 
sizes were based on cytoplasmic staining by 
DiOC6 (79, 24). In single and double kif7/ 
cep290 heterozygous, kif7™ /- and cep290"! - 
kif7 is ovaries, ~40 to 65% of oocytes with ra- 
dial telomeres were mildly larger leptotene- 
zygotene-like oocytes (Fig. 2, E and F, and 
fig. S7A). In cep290' ~ ovaries, 85% were much 
larger leptotene-zygotene-like oocytes (Fig. 2, 
E and F, and fig. S7A). In cep2907”;kif7/~ and 
cep290"' “;kif7" I ovaries, most were severely 
affected and contained low numbers of oocytes 
for analysis, and many ovaries of these geno- 
types converted to testes. In zebrafish, ovary 
conversion to testis is indicative of severe 
oocyte damage and loss (47-49), which we 
address and discuss below and in Fig. 5B 
and fig. $12. Nevertheless, we did capture 
ovaries for analysis. In cep290! ~;kif7! ~ and 
cep290! “;kif7" / “ovaries, 100% of oocytes with 
radial telomeres were very large leptotene- 
zygotene-like oocytes (Fig. 2, E and F, and fig. 
S7A). Similar large leptotene-zygotene-like 
oocytes were detected in cc2d2a"'~ ovaries 
(fig. S7B). This suggests that oocytes continued 
to grow in size, but bouquet formation was at 
least delayed. Such delay in progression from 
leptotene-zygotene to zygotene bouquet stages 
is consistent with the dynamics of meiotic 
checkpoint activation in females, which is diff- 
erent from males (9, 50, 51). 

We tested whether bouquet stage oocytes 
properly formed telomere clusters. We blindly 
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Fig. 2. The zygotene cilium is required for bouquet formation. (A and B) 3D 
measurements of ciliary lengths and average ciliary length per cyst. n, number 

of cilia. ND, not detected. Bars are mean + SD. (C) Distribution of cysts with 
given ciliary length or absence. (D) Images of cysts from ciliary (AcTub) length 
categories as in (B) to (D). Percentage of ovaries with indicated categories are 
plotted. n, number of ovaries. Scale bars, 10 um. (E) Images of “leptotene-zygotene- 


analyzed oocytes in sizes that were typical to 
midbouquet stages showing clear telomere 
clusters in wt (11.5 to 13 um). Telomere dis- 
tribution in wt midbouquet stages showed 
either a tight or slightly expanded telomere 
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cluster (Fig. 2G). Ovaries of heterozygous, 
kif7' /- and cep290"! ~kif7! ~ fish were mostly 
similar to wt (Fig. 2G). However, in cep2907 7 
ovaries, in addition to the wt categories, we 
detected an abnormal “dispersed” category, in 
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largest wt leptotene oocytes. (G) Left panel: images of mid-bouquet-stage oocytes 
with tight, expanded, and dispersed telomeres (categories shown in top right panel). 
ND, nondetected. Scale bars, 5 um. Right bottom panel: the percentage of each 
category in all genotypes. n, number of oocytes. 


which telomeres were largely expanded be- 
yond the nuclear equator (Fig. 2G). More- 
over, in double-mutant cep290! “kif7" /- and 
cep290';kif7’~ ovaries, we only detected “ex- 
panded” and “dispersed” category oocytes 
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Fig. 3. Centrosome anchoring by the zygotene cilium is required for 
telomere clustering in bouquet formation and for proper synaptonemal 
complex formation. (A) Live time-lapse imaging of chromosomal bouquet 
rotations in Tg(h2a:h2a-gfp):cep290°”* and sibling Tg(h2a:h2a-gfp);cep290’- 
ovaries. Chromosomal tracks and their sum at 7; are shown. Individual track 
velocities (n, number of tracks) and average track velocity per cell (n, number of 
cells) are plotted. Oocytes with mean and low outlier (below the SD) values of 
average track velocities are shown for cep290°’-. Images are snapshots from 
movie S12. Scale bars, 5 wm. (B) Laser ablation of the zygotene cilium. The 
targeted cilium (teal arrowhead) is shown “pre-ablation” and the ablation region 


Mytlis et al., Science 376, eabh3104 (2022) 17 June 2022 


at its base (S1:1) is indicated throughout the time-lapse. The ablated ciliary 
associated centrosome (yellow circle) dislocates upon ablation in both XY and Z 
(“centrosome out of focus”) axes (see cartoon, right). Time is indicated in 
mili-seconds. Scale bar, 5 um. n = 17 cilia (oocytes) from n = 11 ovaries. Images 
are snapshots from movie S13; another example is shown in movie S14. 

(C) Images of mid-bouquet stage oocytes from wt and cep290~;kif7'’~ ovaries 
co-labeled for the centrosome (yTub) and telomeres (Telo-FISH). The distance 
between the centrosome and the nucleus (white arrow) in the wt is significantly 
decreased in mutant oocytes with defected telomere clustering (yellow arrowheads); 
see cartoons below. The normalized distances are plotted and bouquet phenotype 
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categories are color-coded. Scale bar, 5 um.n, number of oocytes. (D) The normalized 
centrosome-nucleus distance in oocytes pooled from all genotypes in (C), categorized 
based on their bouquet phenotype. n, number of oocytes. (E) Images of Sycp3 
localization patterns in early and late zygotene stages. Note normal Sycp3 patterns 
in wt oocytes (yellow arrowheads) and their absence in mutant oocytes (scattered 


(Fig. 2G). Thus, telomere clustering is defec- 
tive in bouquet-stage mutant oocytes. Mid- 
bouquet-stage oocytes in cc2d2a~'~ ovaries 
also exhibited consistent telomere-clustering 
defects (fig. S7C). 

We tested whether telomere clustering is 
delayed in mutant ovaries. We repeated the 
above analysis on oocytes at sizes of 13 to 15 um 
in diameter, where the wt telomere clusters 
are “expanded” and “dispersed” (fig. $7, D and 
E) toward bouquet dissociation (19, 24). Het- 
erozygous cep290 and kif7, as well as cep29 pe Be 
kif7" /- and kif7 /- ovaries, were similar to the 
wt (fig. S7D). In cep290" ~ ovaries, telomeres 
were “expanded” and most were “dispersed,” 
and in cep2907;kif7'/- and cep290”;kif7 
ovaries all oocytes were “dispersed” (fig. S7, 
D and E). “Tight” telomere clusters were not 
detected at these stages, ruling out delayed 
clustering. Tight telomere clusters were sim- 
ilarly not detected in late-bouquet-stage oocytes 
in cc2d2a ovaries (fig. S7E). Thus, with loss 
of cilia, telomeres do not cluster properly, re- 
main distributed on the nuclear envelope, and 
fail to form the bouquet. 

These ciliary loss-associated defects are 
similar to bouquet disruption by dynein in- 
hibition (10-17). We treated ovaries with the 
specific dynein inhibitor ciliobrevin and per- 
formed the same bouquet analysis as above 
(fig. S8A). In contrast to control dimethyl sul- 
foxide (DMSO)-treated ovaries, dynein inhibi- 
tion resulted in a dose-dependent increase in 
oocytes of the “expanded” and “dispersed” cat- 
egories and a decrease in those of the “tight” 
category (fig. S8A). Thus, the loss of zygotene 
cilia phenocopied dynein inhibition in bou- 
quet formation. 

We next determined whether these bouquet 
phenotypes arose directly from ciliary de- 
fects. Cep290 localizations include the ciliary 
basal body and transition zone, as well as the 
centrosome (36, 40, 52-56). Cep290 is essential 
for cilia biogenesis (35-40), but whether it is 
required for centrosome MTOC functions in 
oocytes is unknown. The final step in MTOC 
maturation is yTub recruitment to the centro- 
some (57, 58). In cep2907 ~ oogonia and early- 
prophase oocytes, yTub recruitment to the 
centrosome was intact even in the most severe 
cases, where cilia were completely absent (fig. 
S9B). Further, the MT perinuclear organiza- 
tion in cep290! ~ early-prophase oocytes was 
not disrupted (fig. S8C). This demonstrates that 
the centrosome MTOC functions are unaltered, 
confirming that the observed bouquet defects 
arose directly from the loss of cilia. 
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Ciliary anchoring of the bouquet centrosome 
is required for telomere clustering and proper 
SC formation 

We hypothesized that the zygotene cilium 
could generate and transmit forces for telo- 
mere rotations or provide counterbalance for 
their movements by anchoring the bouquet 
centrosome. Force generation by the cilium 
would require dynein and kinesin motor ac- 
tivity along the ciliary axoneme and could in- 
volve cilia motility. We thus addressed the 
motility of the zygotene cilium. Four lines of 
evidence suggest that the zygotene cilium is 
not a motile cilium: (i) Ultrastructure anal- 
ysis of the zygotene cilium showed a 9+0 axo- 
neme organization without obvious dynein 
arms (Fig. 1D), which is typical for nonmotile 
cilia (59, 60); (ii) live time-lapse imaging did 
not reveal ciliary-beating movement, which 
is a characteristic of motile cilia (movies S10 
and S11); (iii) reporter line expression of motile 
ciliary master transcriptional regulators of the 
Foxjl family, Tg(foxjla:GFP) and Tg(foxjlb:GFP) 
(61, 62), was not detected in prophase oocytes 
within germline cysts where the zygotene cilium 
forms (fig. S9); and (iv) motility of zygotene cilia 
was not essential for fertility. Ccdcl03 is a 
motile cilium dynein assembly factor essential 
for motility (63). cedc103""~ adult females, res- 
cued from embryonic lethality through mRNA 
injection, were fertile (five of six females mated 
spontaneously and each produced 50 to 300 eggs 
with a 50 to 100% fertilization rate). Taken to- 
gether, these experiments show that the zygo- 
tene cilium is not a motile cilium. 

To investigate whether the zygotene cilium 
is required to generate chromosomal move- 
ment, we performed live time-lapse imag- 
ing of chromosomal rotations in cep2907! a 
ovaries [T; 19(h2a:H2A-GFP);cep2907! ~ fish; Fig. 
3A and movie S12]. In cep290' ~ oocytes, chro- 
mosomal rotations were still present and we 
only detected a very small decrease in track 
velocities (Fig. 3A and movie S12) compared 
with wt. Twenty percent of cep290! ~ oocytes, 
including 13.6% outliers, were plotted lower 
than the lowest wt oocyte (Fig. 3A and movie 
S12). However, these cep290' ~ oocytes still ex- 
hibited slower but clear rotations (Fig. 3A, left 
bottom panels, and movie S12). The persistence 
of gross chromosomal rotations in cep290" 7 
oocytes allows us to conclude that the zygotene 
cilium is not an essential generator of chromo- 
somal rotation forces. 

We next addressed a potential role for ciliary 
anchoring of the bouquet centrosome. The 
centrosome is stationary during chromosomal 
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foci, magenta arrowheads). Sycp3 signal accumulates in nucleoli (empty red circle) in 
all wt and mutant oocytes. Scale bars, 5 um. (F) The distribution of normal and 
abnormal Sycp3 oocytes from (E). n, number of oocytes. (G) Images of “leptotene- 
like” oocytes shown in scale. Scale bars, 5 um. (H) “Leptotene -like” oocyte sizes per 
genotype. n, number of oocytes. Bars are mean + SD. 


rotations in zebrafish (movie S2) and mice (/8) 
while telomeres are being pulled on MT cables 
that are associated with it. This suggests that 
the centrosome is under strain. It is plausible 
that the centrosome end of the cables must 
be affixed for the telomeres to be pulled and 
clustered toward the centrosome and not 
vice versa. In this case, without such counter- 
balance, telomeres will fail to cluster and form 
the bouquet. We hypothesized that the zygo- 
tene cilium could function as a physical an- 
chor required for counterbalancing telomere 
rotation forces and for telomere pulling and 
clustering. 

We tested whether ciliary excision would 
result in dislocation of the centrosome. To 
experimentally eliminate potential ciliary 
anchoring, we performed laser excision ex- 
periments in ovaries of double-transgenic fish 
fluorescently co-labeled for cilia and centro- 
somes [72(Bact:Cetn2-GFP);Te(Bact:Arl13b-GFP); 
Fig. 3B]. We excised the ciliary base in zygotene- 
bouquet-stage oocytes while tracing the centro- 
some by live time-lapse imaging. These analyses 
showed that upon ciliary excision, the associated 
centrosome immediately dislocated away (Fig. 
3B, yellow circle, and movies S13 and S14, red 
circles). By contrast, other centrosomes in the 
same cyst and frame that were associated with 
non-ablated cilia remained relatively station- 
ary (movies S13 and S14, green circles). This 
supports the specific effect of precise ciliary 
excision of the targeted centrosome achieved 
in this experimental setting. Moreover, control 
experiments with twofold-higher laser power 
immediately resulted in clear and stereotypic 
cell death (movies S15 and S16). This phenom- 
enon was clearly distinct from the localized 
centrosome dislocation observed, arguing against 
nonspecific consequences of laser treatment 
on centrosomal behavior. Thus, when discon- 
nected from the cilium, the centrosome does 
not remain stationary but dislocates. Centro- 
some dislocation is likely caused by the rota- 
tional forces of hauled telomeres that act on its 
associated microtubules. 

We addressed genetically the bouquet cen- 
trosome dynamics upon ciliary loss. We found 
that in the conditions were telomeres were dis- 
persed upon ciliary loss in cep2907”, cep2907- 
kif7'-, and cep290"'-;kif7~, the centrosome 
was much more juxtaposed to the nucleus 
compared with wt (Fig. 3C). We labeled ova- 
ries for the centrosome (yTub) and telomeres 
[Telo-fluorescence in situ hybridization (Telo- 
FISH)] and measured the distance between 
the centrosome and the nucleus in the mid- 
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Fig. 4. The zygotene cilium is required for germline cyst morphogenesis. 
(A, D, and E) Images of cyst (white outlines) phenotype categories labeled for 
the cytoplasmic marker DiOC6 and AcTub (A), showing gaps between oocytes 
(magenta arrows), elongated cytoplasmic bridges (yellow arrows) and isolated 
oocytes (white arrows). Scale bars, 10 um. The percen 
categories and number of isolated oocytes are shown i 
and (E) (n, number of ovaries), respectively. (B) Images of 3D cyst morphology 


zygotene-bouquet stage oocytes normalized to 
oocyte size (arrow in Fig. 3C). These normal- 
ized distances were significantly decreased 
in cep290", cep2907 ;kif7''-, and cep290'; 
kif7! ~ oocytes compared with wt (Fig. 3C). In 
parallel, we independently scored telomere clus- 
tering in these oocytes. The shortest centrosome- 
nucleus distances were measured in oocytes 
with dispersed telomeres (telomere clustering 
categories are color coded in Fig. 3C). We re- 
ciprocally tested the centrosome-nucleus dis- 
tance per telomere clustering category in pooled 
data from all genotypes above (Fig. 3D). We 
found that the centrosome-nucleus distance 
associated with telomere-clustering capability. 
Sufficient distance in “tight” correlated with 
successful clustering, whereas oocytes with 
“dispersed” failed clustering always showed 
the shortest distances (Fig. 3D). This further 
substantiates the view that a counterbalance 
between the centrosome and rotating telo- 
meres is an inherently required feature of 
bouquet formation. Together, these results 
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suggest that the zygotene cilium is required 
for anchoring the centrosome as a counter- 
balance for telomere rotations. When cilia are 
lost, the centrosome is instead pulled toward 
the nucleus, telomere pooling is less efficient, 
and bouquet formation fails. 

We finally tested whether bouquet failure 
affected the formation of SCs between pairing 
chromosomes. It has been shown in zebrafish 
that the bouquet telomere cluster is a hub for 
chromosomal pairing (22). Double-strand breaks 
cluster in subtelomeric regions, and the load- 
ing of SC protein 3 (Sycp3) is initially seeded 
in these regions specifically at bouquet stages 
(22, 49). Sycp3 then spreads from this posi- 
tion along the axes of pairing chromosomes 
(22, 49). This suggests that telomere clustering 
in the bouquet configuration could be func- 
tionally required for SC formation. We analyzed 
Sycp3 localization patterns in whole-mount 
wt and mutant ovaries in the early zygotene- 
bouquet-stage (oocyte size 10 to 12 um) and 


analysis per category. Normalized distances are plotted. n = 5 to 7 cysts from 
n= 2 to 3 ovaries per genotype. Bars are mean + SD. (C) Vasa and AcTub labeling 
shows normal cysts in wt ovaries, and disintegrated cysts (elongated CBs, 
yellow arrows) as well as isolated zygotene oocytes (white arrows) in cep290°’~ 


tion is plotted in fig. S11. Scale bars, 10 um. (F) Severity 
nad conversions plotted for three categories of ciliary 


defects pooled from all genotypes. n, number of gonads. 


12 to 14 um) as defined in fig. $1, B and C. All 
wt and kif7! ~ zygotene oocytes showed nor- 
mal Sycp3 patterns, including initial polarized 
loading of Sycp3 at the early zygotene-bouquet 
stage (22, 49), and the beginning of its ex- 
tension along chromosome axes at the late 
zygotene-bouquet stage (22, 49) (Fig. 3E, yellow 
arrowheads). By contrast, in most zygotene 
stage oocytes of cep290! a cep290' skif7" / ay 
and cep290" “kafz! ~ ovaries, these normal Sycp3 
patterns were completely lost, with only few 
scattered foci remaining in some oocytes (Fig. 
3, Eand F, magenta arrowheads). All zygotene 
oocytes from all genotypes showed accumula- 
tion of Sycp3 signal in oocyte nucleoli (Fig. 3E, 
red empty circles). These results are consistent 
with the corresponding bouquet defects in 
these mutants (Fig. 2G). 

Sycp3 labeling also confirmed the prophase 
delay observed in our Telo-FISH analyses (Fig. 2, 
E and F). Measuring the diameter of all oocytes 
exhibiting the Sycp3 oval patch pattern that is 


the late zygotene-bouquet-stage (oocyte size 
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characteristic of the leptotene stage (23) (fig. 
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S1C), confirmed their typical sizes in wt and 
kif7 I ovaries, and that most of these oocytes 
in cep290"~, cep290 ;kif7""”, and cep290"; 
kif7! ~ ovaries were oversized (Fig. 3, G and H). 
These results confirm defects in prophase pro- 
gression, where defected oocytes continue to 
grow while delayed in prophase, as discussed 
above. 

Altogether, these findings demonstrate that 
upon ciliary loss, failed telomere clustering re- 
sulted in failed Sycp3 loading to and along chro- 
mosome axes. Thus, the zygotene cilium is 
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required for proper formation of SC, at least 
partly through bouquet centrosome anchoring. 


Loss of the zygotene cilium perturbs germline 
cyst morphogenesis 


We identified a second phenotype in the ciliary 
mutant ovaries in which germline cysts were 
disintegrated. The wt cysts are compact, with 
no gaps between cytoplasmic membranes of 
adjacent oocytes (19), as detected by DiOC6 
cytoplasmic staining (“normal” in Fig. 4A). 
By contrast, in cep290! ~ and cep290"' “skif7" I 
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Fig. 5. Ciliary mutants show defective ovarian development and fertility. 
(A) cCaspase3 apoptosis labeling (white arrowheads) in juvenile ovaries. DiOC6 
is a cytoplasmic marker. Scale bars are 50 wm. The number of cCaspase3- 
positive oocytes per gonad are plotted. Each dot represents a gonad, n = 5 to 
10 gonads per genotype. Bars are mean + SD. (B) Rates of 
conversion. n, number of gonads. (C) Images of eggs obtained by squeezing for 
IVF experiments. Scale bar, 1 mm. (D) Adult females and ovaries showing: 
ovaries within the peritoneal cavity (black outline), st.lll premature oocytes 
(green arrows and outlines), young transparent st. oocytes (blue arrows), st.ll 
oocytes (pink arrows) and degenerated tissue masses (red arrow and outline). 
Ruler graduations and scale bars, 1 mm. Mutant females exhibit scoliosis. 


juvenile gonad 


ovaries, we observed abnormal cyst integrity 
(Fig. 4, A and D). We categorized cyst disin- 
tegration on the basis of severity: “Loose cysts” 
showed abnormal gaps between oocytes, where- 
as more severe “disintegrated cysts” showed 
larger gaps and exposed elongated CBs. In 
the compact wt cysts, CBs cannot be resolved 
without a specific marker. However, CBs were 
obvious in the increased spaces between 
oocytes in disintegrated cysts of mutants. To 
quantify cyst disintegration, we measured the 
normalized distance between all neighboring 


8 of 17 


RESEARCH | RESEARCH ARTICLE 


nuclei in 3D in wt, cep2907! ~, and cep290! oR 
kif7" Ir cysts (Fig. 4B, right panel, and movie 
S17). The normalized distances between neigh- 
boring nuclei in cep290"", and cep2907;kif7"!- 
cysts were significantly increased compared 
with wt (Fig. 4B). A similar cyst integrity pheno- 
type with greater normalized distances between 
cyst nuclei was observed in cc2d2a mutant ova- 
ries (fig. S10, A and B). 

We observed a third and most severe cate- 
gory of “isolated oocytes” in cep290/ ~ and 
cep2907” ;kif7""” ovaries (Fig. 4, A and E). In wt 
ovaries, zygotene-bouquet-stage oocytes are 
never seen outside the germline cyst. Isolated 
oocytes are zygotene-bouquet-stage oocytes 
that were found individually scattered in 
cep2907~ and cep2907;kif7'!~ ovaries (Fig. 4:4), 
indicating their disintegration from cysts. 
Telomere labeling (Telo-FISH) confirmed 
that “isolated oocytes” are abnormal zygo- 
tene bouquet oocytes (fig. S11A). In contrast to 
wt, heterozygous and kif7! ~ ovaries, ovaries of 
cep290' 3 cc2d2a7! ~, and cep2907 ~kif7"! ~ fish 
barely contained normal cysts but instead har- 
bored mostly disintegrated cysts and isolated 
oocytes (Fig. 4D and fig. SIOA). The cep290! Ss 
kif7! ~ gonads converted to testes in these ex- 
periments, preventing oocyte analysis, but in- 
dicating oocyte loss caused by severe defects 
(see discussion below and Fig. 5 and fig. S12). 
Labeling ovaries with the germ-cell-specific 
marker Vasa confirmed the detection of iso- 
lated zygotene oocytes in doublets and sin- 
glets, as well as disintegrated cysts with long 
cytoplasmic bridges extending between distant 
oocytes (Fig. 4C and fig. S11, B and C). These 
oocytes exhibited oval and irregular cellular 
morphology that coincided with their likely 
abnormal separation (Fig. 4C). Later pachytene- 
and diplotene-stage oocytes exhibited irreg- 
ular mesenchymal-like cellular morphology 
in cep290! ~ ovaries instead of their smooth 
spherical morphology in the wt (fig. S11C). 
Disintegration of cysts in these Vasa experi- 
ments was mostly apparent in cep290! ~ mu- 
tants, whereas cep2907 ~kif7"! ~ and cep2907 eG 
kif7! ~ were converted to testes (discussed be- 
low) (fig. S11B). These results show that the 
zygotene cilium is required for cyst integrity 
and morphogenesis. 

In all of the experiments above, we co- 
labeled cilia by AcTub staining and simul- 
taneously analyzed cyst integrity and ciliary 
phenotypes. We analyzed the correlation be- 
tween these phenotypes. We categorized gonads 
from all genotypes (n = 43 gonads, n = 10 to 
60 cysts per gonad) on the basis of their ciliary 
phenotype (as in Fig. 2, A to D) and separately 
their cyst integrity phenotypes (as in Fig. 4D). 
This revealed a strong correlation between 
the severity of ciliary loss and severity of cyst 
phenotypes (Fig. 4F and fig. S10C). This cor- 
relation is consistent with our findings that 
ciliary defects directly underlie bouquet pheno- 
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types (fig. S8B). The most severe phenotype 
with complete ciliary loss predicts conversion 
to testes, which represents oocyte loss in zebra- 
fish (discussed below), and was observed in 
~60% of gonads (Fig. 4F). We found similar 
correlation between ciliary loss and cyst pheno- 
types in cc2d2a ovaries (fig. S10C). 


Loss of the zygotene cilium is detrimental 
to oogenesis and fertility 


The above bouquet and germline cyst pheno- 
types represent severe oocyte defects. We tested 
their consequences on oogenesis and whether 
they result in oocyte loss by apoptosis. In wt 
juvenile ovaries, apoptotic oocytes are rare 
(64), as detected by cleaved Caspase3 in wt, 
cep290, and kif7 heterozygous fish (Fig. 5A). 
We detected more apoptotic oocytes in cep2907' z 
kif7’” and in cep290°' ;kif7"~ ovaries, and this 
was significantly increased in cep290" “skif7" I 
and cep290! “SRif7 ’- ovaries (Fig. 5A). These 
results show that ciliary loss results in severe 
oocyte defects and consequently oocyte loss 
by apoptosis. 

Substantial oocyte loss by apoptosis during 
ovarian development in zebrafish often results 
in gonad conversion to testis (9, 47-49). Con- 
sistent with the increased apoptosis levels, 
we found that in juvenile cep290! “;kif7" “and 
cep290' “SRtf7 I fish, most gonads converted 
to testes (Fig. 5B). In all of our analyses above, 
we examined oocyte phenotypes in normal 
and affected ovaries (fig. S12) across the re- 
ported genotypes before potential conver- 
sion. Gonads from these analyses that were 
converted and did not include oocytes were 
categorized separately as “converted to testis” 
(fig. S12) and are described here. In wt and 
heterozygous mutants, we detected up to ~5% 
conversion (Fig. 5B), which represents occa- 
sional earlier sex determination. By contrast, 
~24% of cep290' ~ ovaries were converted, 
whereas ~60% of cep290';kif7'/” and ~80% of 
cep2907! Shif7 ‘- ovaries were converted (Fig. 
5B). cep2907-;kif7"” and cep2907;kif7"~ ovar 
ries concomitantly exhibited the highest levels 
of apoptosis (Fig. 5A). We observed a male 
bias in adult cep290! - cep290! “;kif7* ! “,and 
cep290'";kif7'~ fish (fig. $13). Altogether, 
these results demonstrate that ciliary loss is 
detrimental to oogenesis. 

Ciliary loss could directly cause the above 
bouquet and germline cyst phenotypes, which 
in turn induce apoptosis. Alternatively, they could 
induce oocyte apoptosis, which in turn causes 
these phenotypes downstream. Throughout our 
analyses above, bouquet and cyst phenotypes 
were detected in oocytes that otherwise appeared 
viable, suggesting that they preceded apoptosis. 
To confirm this, we crossed cep290 fish to tp53 
mutants, which rescues apoptosis (47, 48, 50). 
We confirmed that loss of Tp53 completely res- 
cued oocytes from apoptosis in cep290! 3 
tp53! ~ ovaries (fig. S14A), as well as juvenile 
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ovary conversion to testes (fig. S14B) and the 
male bias in adults (fig. S14C). Double-mutant 
juvenile ovaries continued to develop and con- 
tained later-stage oocytes with an extremely 
abnormal morphology that is never detected 
in wt or cep2907! ~ ovaries (fig. S14D). This dem- 
onstrates that in cep290' ~3tp53! ~ ovaries, de- 
fective oocytes, which in cep290' ~ ovaries were 
otherwise cleared by apoptosis, survived but 
failed to develop normally. These results con- 
clude that the loss of the zygotene cilium is 
detrimental to oocyte development. The de- 
fects in bouquet and germline cyst morphol- 
ogy likely activate a meiotic checkpoint (which 
in turn induces tp53-dependent apoptosis). 

We reevaluated the ciliary bouquet and cyst 
phenotypes in cep290! ~tp53! ~ gonads. We 
reasoned that if the bouquet and cyst pheno- 
types were caused by oocyte apoptosis, then 
they should be rescued in cep2907! ~tp53! ~ 
ovaries. However, if they were caused by the loss 
of cilia, then they should persist in cep2907! 5 
p53! ~ ovaries. Bouquet phenotypes, including 
prophase delay and telomere-clustering de- 
fects, were consistently prominent in cep2907 7 
as well as in cep290! ~¢p53! ~ ovaries (fig. S15). 
These findings show that these bouquet pheno- 
types are caused independently of apoptosis. 
In tp53! ~ ovaries, we detected slightly over- 
sized “leptotene-zygotene-like” oocytes (fig. S15, 
A and B) and ~20% mid-bouquet oocytes with 
dispersed telomere clusters (fig. S15, C and D). 
These likely represent oocytes that are normally 
cleared by checkpoint-induced apoptosis in wt 
ovaries but are retained in p53! ~ ovaries. 

We next examined the cyst phenotypes after 
p53 knock-out. In both cep290" ~ and cep290" = 
p53! ~ ovaries, cysts were mostly loose or dis- 
integrated compared with wt (fig. S16, A and 
C), as confirmed by 3D cyst analysis (fig. S16, 
B and D). This argues that apoptosis is not a 
primary cause for cyst disintegration. Although 
we detected more normal cysts in cep2907! % 
p53! ~ ovaries than in cep2907! ~ ovaries, >60% 
of cysts were defective in these ovaries compared 
with none in the wt and ~2% in p53! ~, The above 
analyses argue that bouquet and cyst pheno- 
types are not direct consequences of oocyte 
apoptosis. We conclude that ciliary loss causes 
bouquet and germline cyst defects that induce 
apoptosis, which is detrimental to oogenesis. 

For an ultimate readout of oogenesis, we 
examined adult ovaries and female fertility. 
kif7 I females, in which ovaries showed nor- 
mal or only mildly shortened cilia (Fig. 2, A 
and B), were fully fertile. By contrast, we found 
that cep2907 ~ and cc2d2a’~ adult females did 
not mate spontaneously in either homozygous 
in-crosses or when homozygous females were 
crossed to wt males (fig. S17A). This indicates 
defective oogenesis, or abnormal spawning or 
mating behavior, which might be secondary to 
the characteristic cilia loss-scoliosis pheno- 
type in mutants (34, 45, 65, 66). However, 
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considering that rescued ccdc103 mutants 
are fertile (above) despite having profound 
scoliosis, this is an unlikely possibility. To 
specifically test for abnormalities in oogenesis, 
we performed in vitro fertilization (IVF) assays 
between homozygous mutant females and 
wt males. Most cep2907! ~ and cc2d2a’~ females 
produced degenerated eggs or no eggs at all 
(Fig. 5C and fig. S17A). Consistent with their 
deficient fertility, cep290" ~ and cc2d2a’~ fe- 
males exhibited profound ovarian dysgenesis 
(Fig. 5D and fig. S17C). Compared with wt, 
mutant ovaries were smaller, underdeveloped, 
and exhibited degenerated masses of tissue 
(Fig. 5D and fig. S17C). We were able to con- 
firm these phenotypes in a fifth ciliary mutant 
that is viable in zebrafish and causes human 
ciliopathies, armc9 (9, 65, 67, 68). armcg 
females exhibited similar fertility deficiency in 
both spontaneous mating and IVF experi- 
ments, as well as ovarian dysgenesis (fig. S17, 
A to C). These data show that the zygotene 
cilium is required directly in oogenesis and is 
essential for ovarian development and fertility. 


The zygotene cilium is conserved in male and 
female meiosis in zebrafish and mouse 


Considering the importance of zygotene cilia 
in oogenesis, we investigated whether it also 
forms during spermatogenesis. In juvenile testes, 
co-labeling of AcTub with the cytoplasmic 
membrane marker B-catenin and with the pro- 
phase marker Sycp3 detected cilia-like structures 
in prophase spermatocytes within developing 
seminiferous tubules (fig. S18A). Sperm flagella 
are specialized ciliary structures with similar 
acetylated tubulin-positive axonemes (69). 
Flagella formation begins after prophase, at 
midround spermatid stages, and completes 
by the mature spermatozoa stage (69). To dis- 
tinguish potential zygotene cilia from flagella, 
we analyzed seminiferous tubules in adult 
testes, where all spermatogenesis stages can 
be detected. We co-labeled AcTub with three 
prophase spermatocyte markers separately: 
telomere dynamics (Telo-FISH) (23), Sycp3 
(49), and yH2Ax (49) (Fig. 6, A to C). Round 
spermatids had AcTub-positive flagella [E2 
in Fig. 6, A to C; staging criteria in (9)], and 
flagella of spermatozoa were very long and 
swayed within the lumen of the tubules (E3 in 
Fig. 6, A to C). Zygotene-stage spermatocytes 
with the characteristic bouquet patterns of telo- 
meres (Telo-FISH), Sycp3, and yH2Ax (23, 49), 
showed clear AcTub-labeled cilia (Fig. 6, A to C). 
In pachytene spermatocytes and E1 early-round 
spermatid stages, some showed AcTub-positive 
cilia whereas others did not (Fig. 6, A and B). 
These stages likely represent an intermediate 
time window during and between zygotene cilia 
removal and flagella formation. TEM analysis 
confirmed ciliary structures in zygotene sperma- 
tocytes (Fig. 6D). These emanated from densely 
packed spermatocytes (Fig. 6D) that are distinct 
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from the flagella structures in the lumen of 
seminiferous tubules (69) (fig. SI8C). Thus, the 
zygotene cilium is a separate structure from 
the flagellum and occurs in the same meiotic 
stages in spermatogenesis as in oogenesis. 

Bouquet mechanisms of meiotic chromo- 
somal pairing are universal in sexually re- 
producing species (J0-18, 70-73). We thus 
investigated whether the zygotene cilium is 
conserved in mammals. In the mouse fetal 
ovary, oocyte development is synchronized 
(74-77). Sycp3 labeling showed that E14.5 ova- 
ries contained mostly early-prophase oocytes 
(74-77) (Fig. 6E). AcTub labeling of E14.5 ova- 
ries identified two structures in Vasa-positive 
germline cysts (Fig. 6F): One resembled a CB 
and the other a cilium. We confirmed the ciliary 
structures by co-labeling AcTub and Vasa with 
ciliary markers. yTub (Fig. 6G), Arl13b (Fig. 
6H), GluTub (Fig. 6D, and Adcy3 (Fig. 6J) all 
confirmed AcTub-positive ciliary structures 
adjacent to negatively stained AcTub-positive 
CBs. Co-labeling for Vasa, Sycp3, and GluTub 
confirmed that the cilia form in prophase 
oocytes and, similar to zebrafish, within the 
germline cyst (Fig. 6K and fig. S18D). We also 
identified ciliary structures in spermatocytes 
in postnatal day 12 and 16 mouse testes using 
multiple ciliary markers including Arl13b and 
GluTub co-labeled with Sycp3 (fig. S19). Adcy3 
further identified these ciliary structures and 
colocalized to the centrosome basal body as 
labeled with yTub (fig. S19A). These ciliary 
structures in mouse spermatocytes were very 
short and included mostly a shortly extended 
basal body (arrowheads in fig. S19, right pan- 
els). Highly similar atypical ciliary structures 
with yet unknown functions have been previ- 
ously reported in Drosophila spermatocytes 
(78). These results demonstrate that the zygo- 
tene cilium is conserved between sexes and 
across multiple organisms, suggesting that it 
could be an inherent and fundamental part 
of the meiosis program. 


Discussion 


Our work identifies the cilium as a conserved 
player in meiosis that regulates chromosomal 
bouquet formation and germ cell morphoge- 
nesis. The bouquet telomere dynamics facilitate 
chromosomal homology searches for pairing 
and was first described in 1900 (20). However, 
fundamental principles of its activity and regu- 
lation are still being uncovered (15, 16, 72, 79-81). 
Our work describes the complete framework 
of the cytoplasmic bouquet machinery as a ca- 
ble system assembled by the zygotene cilium, 
centrosome, and microtubules. This machinery 
biomechanically facilitates telomere dynamics 
for chromosomal bouquet formation in meiosis. 
We have shown that as part of this machinery, 
the zygotene cilium anchors the centrosome to 
counterbalance telomere rotation and pulling 
to form the bouquet. Further, we have shown 
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that bouquet telomere clustering is required for 
proper SC formation. In this machinery, the 
cilium could also contribute to meiotic progres- 
sion and cyst organization by transducing an as- 
yet unknown signal, which will require further 
investigation. 

Our findings suggest that telomere clus- 
tering in the bouquet is required for proper SC 
formation. Investigations in mice established 
that telomere rotations are essential for ho- 
mology searches (11-13, 15, 16). However, mu- 
tants in these studies prevented proper loading 
of telomeres onto the NE and/or their asso- 
ciation with MT, precluding the experimental 
uncoupling between telomere rotations and 
their clustering (77-13, 15, 16). We now show 
that upon ciliary loss, telomeres rotate on the 
NE but do not cluster. In these conditions, Sycp3 
fails to load on chromosomes and oocytes un- 
dergo P53-dependent apoptosis. It is plausible 
that in the bouquet, telomere clustering could 
contribute to Sycp3 seeding in assembling the 
SC. Recent yeast-based mathematical model- 
ing of meiotic chromosomal dynamics showed 
that pairing is faster if a bouquet configura- 
tion is added to the modeling simulation (82). 
In zebrafish, the bouquet telomere cluster was 
proposed as a hub for chromosomal pairing 
(22). DSB cluster in subtelomeric regions and 
Sycp3 and Sycp1 initially load adjacent to clus- 
tered telomeres, specifically at the bouquet stage 
(22, 49), and from this position, Sycp3 and Sycp1 
then extend along chromosomal axes (22, 49). 
Our work provides functional evidence to sup- 
port this mechanism. 

Our findings shed new light on reproduc- 
tion phenotypes in ciliopathies. These have so 
far been explained by sperm flagella defects 
and defective motile cilia lining the fallopian 
tubes and efferent ducts (83, 84). Our study 
reveals ciliary functions in early differentiat- 
ing gametes and in fetal ovaries. Severe pe- 
diatric ciliopathy syndromes are lethal or lead 
to acute developmental defects in young chil- 
dren before puberty (37-40, 85-89), likely 
masking reproduction phenotypes. In milder 
ciliopathies, phenotypes could be variable. In 
our analyses, truncated cilia produced milder 
phenotypes than complete ciliary loss. Fur- 
ther, cep290, cc2d2a, and armc9 females had 
deficient fertility, but some were able to pro- 
duce fertilizable eggs. This is consistent with 
reported pleiotropic and variable ciliopathy 
phenotypes in humans (37-40, 85, 86). Thus, 
our findings uncover a mechanism that could 
additionally contribute to reproduction phe- 
notypes in ciliopathies. 

Finally, our work offers the paradigm that 
ciliary structures can control nuclear chro- 
mosomal dynamics. Most cells in metazoans 
are ciliated and many exhibit cell-type-specific 
nuclear and/or chromosomal dynamics, imply- 
ing that ciliary regulation of nuclear events 
may be widely conserved. 
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Materials and Methods 

Ethics statement 

All animal experiments were supervised by 
the Hebrew University Authority for Biolog- 
ical Models, the Veterinéramt Ziirich, and the 
Singapore National Advisory Committee on 
Laboratory Animal Research according to the 
institutional animal care and use committee 
and accredited by AAALAC. All experiments 
were appropriately approved under ethics re- 
quests MD-2016222-1, MD-18-15600-2, ZH116/ 
2021- 33632, 191419, and 191429. 


Fish lines and gonad collections 


Juvenile ovaries were collected from 5 to 7 weeks 
postfertilization (wpf) juvenile fish. Fish had 
a standard length (SL) measured according 
to (90) and were consistently ~10 to 15 mm in 
length. Ovary collection was done as in (/8, 24). 
Briefly, to fix the ovaries for immunostaining 
and DNA-FISH, fish were cut along the ventral 
midline and the lateral body wall was removed. 
The head and tail were removed, and the trunk 
pieces with the exposed abdomen containing 
the ovaries were fixed in 4% paraformaldehyde 
(PFA) at 4°C overnight with nutation. Trunks 
were then washed in phosphate-buffered saline 
(PBS), and ovaries were finely dissected in cold 
PBS. Ovaries were washed in PBS and then 
either stored in PBS at 4°C in the dark or de- 
hydrated and stored in 100% MeOH at -20°C 
in the dark. Testes were collected similarly 
from 7-wpf juvenile fish and from adult fish 
and processed similarly. Adult ovaries were 
collected from wt and mutant fish using sim- 
ilar microdissection and imaged using a Leica 
S9i stereomicroscope and camera. 

Fish lines used in this research were: TU 
wt, cep290°9"(34), kif7?"(41), cc2d2a""® (45), 
tp53?“* (91), Tg(u-act:Arl13b-GFP) (92), 
Tg(h2a:H2A-GFP) (93), Tg(u-act:Cetn2- GFP 
(94), Tg(foxjla:GFP) (62), Tg(foaj1b:GFP) (61), 
Tg(u—act: mCherry-Cep55l),and armcg?> 
(68). The Tg(u-act: mCherry-Cep551) line was 
generated using the tol2 system. mCherry- 
cep55l was polymerase chain reaction ampli- 
fied from a mCherry-cep55l CI vector (95) 
and cloned into pDest-Tol2 plasmid under 
the f-actin promoter using Gibson assembly 
(E5510S; New England Biolabs). 


Staging of prophase oocytes 


Oocyte prophase stages were determined on 
the basis of established staging criteria (18, 24) 
(fig. SIB). We established prophase staging 
criteria for zebrafish oocytes on the basis of 
the characteristic telomere dynamics at each 
stage (18, 24). Briefly, telomeres are distrib- 
uted intranuclearly in mitotic oogonia, then 
load on the NE at the onset of meiosis and 
remain radially distributed on the NE through 
the leptotene-zygotene stages (including dur- 
ing chromosomal rotations). Telomeres are 
clustered at one pole of the nucleus during 
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zygotene bouquet stages. Later, they disperse 
radially and dissociate back into the nuclear 
vicinity (18, 24). We characterized cellular fea- 
tures, including the typical oocyte size and 
nuclear morphology, that strictly and distinc- 
tively accompanied each telomere distribution, 
such that they identify each stage even without 
telomere labeling (18, 24) (fig. SIB). We developed 
a reliable method for measuring oocyte size in 
diameter in 3D (24), which we routinely use for 
determining the oocyte size as part of its stag- 
ing criteria. For example, oogonia with dispersed 
intranuclear telomere distribution was always 
~10 um with noncondensed 4’,6-diamidino-2- 
phenylindole (DAPI)-labeled DNA and central 
one to three nucleoli, leptotene-zygotene stages 
with telomere radially loaded on the NE were 
always 7 to 9 um with central few nucleoli and 
zygotene bouquet with clustered telomeres on 
the NE were always 10 to 16 um with condensed 
DAPI-labeled chromosomes and a peripheral 
single nucleolus (18, 24) (fig. S1B). 

In this study, to further precisely determine 
the specific leptotene-zygotene stages of pro- 
phase oocytes, we used the established nuclear 
patterns of the synaptonemal complex protein 
Sycp3. We identified the characteristic patterns 
to each prophase stage as established in (22, 23). 
The leptotene stage is characterized by an elon- 
gated patch of Sycp3 signal in the oocyte nucleus 
(22, 23) (fig. SIC). Early zygotene is characterized 
by a polarized Sycp3 signal that corresponds to 
the bouquet telomere cluster, where it begins to 
load on chromosomes in zebrafish (22, 23) (fig. 
S1C). During late zygotene, the Sycp3 signal ex- 
tends from its polarized location along chro- 
mosomal axes (22, 23) (fig. SIC). We measured 
oocyte size per each Sycp3 pattern in three 
dimensions in wholemount ovaries, as described 
in (24). Figure SIC shows Sycp3 patterns and 
their corresponding oocyte sizes. Note that these 
Sycp3-based oocyte sizes are fully consistent with 
these based on Telo-FISH and morphological 
features above, further confirming our staging 
criteria. In all our analyses we always included 
cytoplasmic labeling (like DiOC6) or used the 
background of other labeling to visualize oocyte 
cytoplasm, measure oocyte size in three dimen- 
sions (18, 24), and determine its prophase stage 
based on the size and morphological criteria. 


Fluorescence immunohistochemistry (IHC) 
and DNA-Telo-FISH 


IHC was performed as in (18, 24). Briefly, ova- 
ries were washed twice for 5 min in PBT (0.3% 
Triton X-100 in 1x PBS; if stored in MeOH, 
ovaries were gradually rehydrated first), then 
washed four times for 20 min in PBT. Ovaries 
were blocked for 1.5 to 2 hours in blocking 
solution (10% fetal bovine serum in PBT) at 
room temperature and then incubated with 
primary antibodies in blocking solution at 4°C 
overnight. Ovaries were washed four times 
for 20 min in PBT and incubated with second- 
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ary antibodies in fresh blocking solution for 
1.75 hours; they were light protected from this 
step onward. Ovaries were washed four times 
for 20 min in PBT and then incubated in PBT 
containing DAPI (1:1000; Molecular Probes) 
with or without DiOC, (1:5000; Molecular Probes) 
for 50 min and washed twice for 5 min in PBT 
and twice for 5 min in PBS. All steps were 
performed with nutation. Ovaries were trans- 
ferred into Vectashield (with DAPI; Vector 
Laboratories). Ovaries were finally mounted 
between two #1.5 coverslips using a 120-um 
spacer (Molecular Probes). 

Primary antibodies used were y-tubulin 
(1:400; Sigma-Aldrich), green fluorescent pro- 
tein (GFP; 1:400; Molecular Probes), Vasa 
(1:5000) (96), acetylated tubulin (1:200; Sigma- 
Aldrich), B-catenin (1:1000; Sigma-Aldrich), 
Arl13b (97), glutamylated tubulin (1:400; 
Adipogen), Sycp3 (1:200; Abcam), ywH2Ax (1:400; 
GeneTex), and cCaspase3 (1:300; Abcam). Sec- 
ondary antibodies were used at 1:500 (Alexa 
Fluor; Molecular Probes). 

Telo-FISH was performed using the PNA 
technique (PNA-Bio) following the manufac- 
turer’s protocol. Hybridization buffer was 70% 
formamide, 1 mM Tris, pH 7.2, 8.5% MgCl, 
buffer (25 mM magnesium chloride, 9 mM 
citric acid, and 82 mM sodium hydrogen phos- 
phate, pH 7), 1x blocking reagent in maleic 
acid buffer (100 mM maleic acid, pH 7.5), 0.1% 
Tween 20, and 88 nM probe (5 -CCCTAACCC- 
TAACCCTAA-3 ,Cy3-conjugated). 

For a combination of IHC with Telo-FISH, 
IHC was performed first. At the end of the IHC 
procedure, ovaries were washed an extra time 
for 30 min in PBT and fixed quickly in 4% PFA 
for 15 to 20 min at room temperature. After 
staining was complete, DAPI (with or without 
DiOC,) staining and mounting was performed 
as described above. 


Confocal microscopy, image acquisition, 
and image processing 


Images were acquired on a Zeiss LSM 880 confo- 
cal microscope using a 40x lens. The acquisition 
setting was set between samples and experiments 
to: XY resolution = 1104 x 1104 pixels, 12-bit, 2x 
sampling averaging, pixel dwell time = 0.59 s, 
zoom = 0.8x, pinhole adjusted to 1.1 um of Z 
thickness. Increments between images in stacks 
were 0.5 3um, and laser power and gain were set 
in an antibody-dependent manner to 7 to 11% 
and 400 to 650, respectively, below saturation 
condition. Unless otherwise noted, images shown 
are partial sum Z-projection. Acquired images 
were not manipulated, and only contrast and 
brightness were linearly adjusted. All figures 
were made using Adobe Photoshop CC 2014. 


Confocal live and time-lapse imaging 
of cultured ovaries 


Live imaging was performed as in (24). Briefly, 
ovaries were dissected from juvenile fish (5 to 
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7 wpf, SL ~10 to 15 mm) into fresh warm HL-15 
medium [Hanks solution containing 60% L-15 
with no phenol red (Sigma-Aldrich) and 1:100 
Glutamax (Invitrogen) warmed to 28°C]. Ova- 
ries were then embedded in 0.5% low-melt 
agarose in HL-15 on a glass-bottom dish and 
covered with warm HL-15. After the agarose 
polymerized, ovaries were incubated in HL-15 
at 28°C. Time-lapse images of live ovaries were 
acquired using either a Zeiss LSM 880 or Nikon 
Ti2E spinning-disk confocal microscopes, both 
equipped with an environmental chamber set 
to 28°C and using 40x and 60x lenses, re- 
spectively. Images were acquired every 6 s for 
single Z sections or every 18 to 20 s for Z stacks 
for 5 to 10 minutes. 


Chromosome rotation tracking in live 
time-lapse images 


Live time-lapse imaging of Tg(h2a:h2a-gfp) 
cultured ovaries were performed as above, 
recording large frames of ovaries that in- 
cluded multiple zygotene oocytes, oogonia, 
and somatic pre-granulosa cells side by side 
and under identical conditions. Chromosomal 
dynamics in these cells were tracked on the basis 
of H2A-GFP intensities using the TrackMate 
plugin on ImageJ (98) with the following set- 
ting parameters. Blob diameter was set to 3 um 
with an appropriate threshold, and Linking 
distance was set to 4 um with a maximum 
permissibility of two frame gaps for gap closing 
and a gap-closing maximum distance of 1 um. 
Identified tracks were confirmed manually. 
Selected tracks were identified over 20 to 30 
consecutive time points; at least six consec- 
utive time points were selected, and images 
with at least two appropriately identified tracks 
were analyzed. For track visualization, spot 
sizes were set to 0.2 of the original spot radius 
ratio (~0.6 um), and tracks were set to show 
tracks backward and color-coded for time. Tracks 
were then recorded as an overlay on the original 
image using the “capture overlay” function. 
Track parameters, including velocity, displace- 
ment, and track duration, were exported to Ex- 
cel and GraphPad Prism for statistical analysis. 


Coloc2 analysis 


A pixelwise quantitative test for colocalization 
of AcTub with Arl13b and with GluTub signals 
was performed using the Coloc 2 plug-in on 
Fiji as in (18). Cilia regions of interest (ROIs) 
were taken along individual cilia from repre- 
sentative images. The shapes of the ROIs vary 
according to the cilia orientation: ROI of straight 
cilia are narrower, whereas ROI of bent or 
curled cilia are wider. However, the spaces 
taken around the cilia were consistent in all 
ROIs. We calculated a point spread function of 
~1.6 pixels on average for each channel and 
image based on PSF = 0.8 x exitation wave- 
length (nm)/objective’s numerical aperture]/ 
pixel size (nm). Minimum and maximum thresh- 
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olds were automatically set by the plug-in for 
each image. The Pearson correlation coefficient 
(R) was calculated. R values were interpreted 
as follows: -1 < R < 0 for anticorrelation; R = 0 
for no correlation; and 0 < R < 1 for correla- 
tion. The plug-in then scrambled the pixels 
to generate a random image and test for the 
probability to yield the same Pearson coeffi- 
cient from a random image. Scrambling was 
set to 100 iterations, generating 100 different 
random images from the tested image’s pixels. 
The probability of receiving the calculated 
Pearson coefficient values for both AcTub 
and Arl13b and AcTub and GluTub were P = 1. 
A pixelwise quantitative test for colocalization 
of GluTub with Cep55 was performed similarly. 


Image 3D reconstructions in IMARIS 


ROIs of whole cysts were extracted from con- 
focal raw data and used to reconstruct nuclei, 
cilia, and other cellular features with respect 
to each other using blend volume rendering 
mode. Signal brightness and object transparency 
were edited in all channels to optimize signal 
and reduce background. Animation frames were 
made using the key frame animation tool. 


SBF-SEM 


Resin-embedded ovaries were mounted onto 
a cryo pins using conductive silver epoxy, and 
targeted trimming was performed using an 
ultra-microtome (Leica) to expose specific areas 
of the ovaries. The trimmed block was painted 
with Electrodag silver paint and coated with 
10 nm AuPd using a Q150T sputter coater 
(Quorum Technologies). 

Cysts were imaged using a Gatan 3View 
mounted on a Quanta 250 SEM (FEI). Two 
areas containing cysts were imaged using the 
multiple ROI function, ROI-00 and ROI-01. 
Imaging conditions were as follows: for ROI 
00, magnification 3445x, pixel size 5.9 nm in x 
and y, 75 nm in zg, image dimensions 8189 x 
5932 pixels; for ROI 01, magnification 3480x, 
pixel size 5.9 nm in w and y, 75 nm in g, and frame 
width 8192x 5932. The chamber pressure was 
70 Pa. 2.8kV, dwell time per pixel 68 us for both 
ROIs. Before converting the images into TIFF 
format with Digital Micrograph, they were binned 
by two, giving a final pixel size in # and y of 12 nm. 


3D rendering of SBF-SEM images 


Renders were made from over 330 75-nm sec- 
tions for each cyst (movie $3). The image se- 
quence generated from SBF-SEM was imported 
into ImageJ. The imported image was then 
adjusted with the correct voxel size, bright- 
ness, contrast, and alignment. The stack was 
imported into TrakEM2 canvas for segmen- 
tation. In TrakEM2, a project tree was made 
by adding area lists. Once the area list was 
made, cells were manually segmented at every 
seventh to eighth slice using the brush tool of 
ImageJ, and the gap was filled by using the 
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interpolation function of TrakEM2. The seg- 
mented structure of the cyst was then exported 
as a TIFF file to IMARIS software, and image 
properties were set to the acquisition voxel 
size. Surfaces were created using the surface 
creation wizard, and color and transparency 
were set for individual cells. The segmented 
cells were then exported as “scenes.” This pipe- 
line was repeated for different structures of 
the cyst and exported a different scenes from 
the area lists. Then, the SBF-SEM image stack 
was imported to the arena area of IMARIS. 
The voxel size was corrected, and Scenes were 
imported. Once all the scenes were imported, 
color and transparency were set for individ- 
ual elements for optimal representation. The 
reconstruction was finally saved using the 
animation function of IMARIS (movies S83 
and S6). 


Rescue of ccd103 mutant embryonic lethality 


The mRNA of zebrafish ccdc103 was transcribed 
from template with the mMessage mMachine 
SP6 transcription kit (Invitrogen, AM1340). To 
rescue the mutants, a total of 0.5 nl of the mRNA 
with final concentration of 150 ng/ul was in- 
jected into the animal pole of one-cell-stage 
embryos. Rescued ccd103"/~females were con- 
firmed by genotyping and exhibited the char- 
acteristic scoliosis phenotype, which results 
from loss of ciliary motility from ependymal 
cells in the spine central canal. 


Measurements of cilia and CBs in SBF-SEM data 


The image sequence was imported as a stack 
in Fiji. The image voxel size was set as per 
the acquisition metadata by selecting im- 
age properties. Slices in which cytoplasmic 
bridges or cilia were widest were selected. 
Using the line tool from the Fiji menu bar, 
measuring lines were drawn across the width 
of the cilia or CBs. The lines drawn were 
color-coded red from overlay properties 
and added to the ROI manager. Three mea- 
surement lines were drawn across cilia and 
CB longitudinal sections. Two perpendic- 
ular lines were drawn across cilia cross- 
sections (along perpendicular diameters). 
All lines in ROI manager were measured, 
and the resulting table was saved as a CSV 
file, exported to Excel for calculating average 
width of CB and cilia, and graphical repre- 
sentation was produced using GraphPad 
Prism 8. 


Measurements of cilia length in confocal data 
and defining ciliary length categories in wt 
and mutant ovaries 


Ten representative cyst ROIs from each geno- 
type were cropped in Fiji and used for image 
processing in the IMARIS 3D software. The 
filaments IMARIS feature was used to mark 
the cilia pathways, as follows. To create the 
cilium filament, the auto-depth semi-automatic 
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IMARIS algorithm was used, combining user’s 
manual tracking and local signal intensity 
(fig. S6). After creating filaments for all cilia 
in a cyst, the filament lengths were extracted 
to Excel and GraphPad Prism for statistical 
analysis, and the images of cilia tracks were 
extracted as JPEG files (fig. S6). 

We first measured ciliary lengths in wt cysts 
in 3D as described above and defined ciliary 
length categories. wt cilia mostly ranged be- 
tween 4 and 8.5 um, with the longest cilia 
reaching 12 mm (Fig. 2A). We defined ciliary 
lengths at 4 to 8.5 um as “normal cilia” and 
cilia with outlier values <4 um as “short cilia.” 
Cilia that were only detected as an extremely 
short dot-like signal that was <1 um or not 
detected at all were defined as “absent cilia” 
(ND in Fig. 2, A and B). Individual cilia were 
measured as described above in cysts from all 
genotypes (Fig. 2A), and the average ciliary 
length per cyst was calculated (Fig. 2B). Cysts 
with average cilia lengths >4 um were scored 
as cysts with normal cilia, and cysts with av- 
erage cilia lengths <4 um were scored as cysts 
with short cilia (Fig. 2B). Distribution of scored 
cysts were plotted (Fig. 2C), entire ovaries were 
scored, and their distribution for the same 
categories was plotted (Fig. 2D). 


Bouquet analysis 


Zygotene bouquet stages span oocyte sizes of 
10 to 16 um in diameter (18, 24). We reasoned 
that the telomere cluster may be less compact 
during its early formation (oocyte sizes of 10 
to 11.5 um in diameter) and its later dissoci- 
ation (oocyte sizes 13 to 16 um in diameter). 
We therefore focused on midbouquet stages 
(oocyte sizes 11.5 to 13 um in diameter) that 
show clear clusters (18, 24). We co-labeled 
telomeres by Telo-FISH and the cytoplasm 
with DiOC, (Fig. 2G). We scored all oocytes 
blinded to the Telo-FISH channel, measured 
their size in diameter on the basis of the 
DiOC, cytoplasmic labeling according to (24), 
and selected oocytes at midbouquet sizes (11.5 
to 13 um). We then used the Telo-FISH chan- 
nel to analyze the distribution of telomeres in 
all Z sections spanning these entire oocytes. 
Categories were defined as “tight” for telomere 
clusters at one pole of the nucleus that spanned 
up to approximately 25% of the nuclear cir- 
cumference, “expanded” for wider clusters that 
spread beyond 25% of the nuclear circumfer- 
ence and up to the nuclear equator, and “dis- 
persed” for unclustered telomeres that spread 
over the nuclear equator, as shown in Fig. 2G. 
The analysis in fig. S7D, which addresses 
later bouquet stages, was performed identi- 
cally, but selected oocytes were at a size range 
of 13 to 15 um. 


Ciliobrevin treatment 


Ovaries were dissected from juvenile wt fish 
and cultured in HL-15 as described in the “Con- 
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focal live and time-lapse imaging of cultured 
ovaries” section above. The HL-15 medium was 
then replaced with HL-15 medium containing 
either DMSO or 15 or 50 uM ciliobrevin. Ovaries 
were incubated for 80 min at 28°C, and fixed for 
Telo-FISH, DiOC,, and DAPI labeling. Bouquet 
analysis was performed on 11.5- to 13-um oocytes 
as described above. 


Ciliary laser excision 


Ovaries were dissected from 6 wpf Tg(Bact: 
Arll13b-GFP);Tg(Bact:Cetn2-GFP) fish and 
mounted as described for live imaging above. 
Excisions were performed using a Leica TCS 
SP8 MP two-photon microscope with a 25x 
objective and equipped with an incubation 
chamber set to 28°C. An ROI with a good num- 
ber of cilia in the field of view was selected. The 
laser stimulation region for excision was drawn 
manually using the “draw ROI” function. Time- 
lapse was recorded for 60 s before ablation, fol- 
lowed by a laser stimulation (900 nm, 2.79 W) 
for 20 s, and time-lapse recording for 8 min after 
ablation (movies S13 and S14). Control experi- 
ments (movies S15 and S16) were performed 
side by side with the experimental ablations 
under identical conditions and settings, except 
laser power was set to 4 W for 20 s. 


Centrosome-nucleus distance measurements 


The distance between the centrosome and the 
nucleus was measured in wt and mutant ova- 
ries co-labeled for centrosome (y-Tub antibody) 
and telomeres (Telo-FISH) and counterstained 
with DAPI. Oocyte at the mid-zygotene bouquet 
stage were selected as in the bouquet analysis 
above (oocyte size 11 to 13 1m). For centrosome- 
nucleus distance measurements, the optical sec- 
tion that includes the center of the centrosome 
was selected, and the distance between the 
centrosome and the closest point on the nu- 
cleus surface was measured in Fiji. Measure- 
ments were tracked using the Fiji ROI manager. 
We then independently scored the telomere 
clusters of measured oocytes to the tight, ex- 
panded, and dispersed categories according to 
the bouquet analysis above. Centrosome-nucleus 
distances were normalized to oocyte size (diam- 
eter), plotted per genotype, and color-coded for 
telomere cluster categories (Fig. 3C). Pooled oo- 
cytes from all genotypes in this analysis were also 
plotted per telomere cluster category (Fig. 3D). 


Sycp3 phenotype analysis 

To analyze effects on Sycp3 in wt and mutant 
whole-mount ovaries labeled for Sycp3 and 
DAPI, we scored oocytes at three stages and 
sizes based on our wt prophase staging in fig. 
S1, B and C: early zygotene (10 to 12 um) and 
late zygotene (12 to 14 um). For each stage, we 
selected oocytes of the appropriate size range, 
and then scored the Sycp3 signal as either 
normal or defected (Fig. 3E) compared with 
wt and as shown in fig. S1, B and C. 
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Quantitative analysis of cyst morphology 
and integrity 
We defined phenotype categories for germline 
cyst morphology, as follows: The “loose cysts” 
category showed abnormal gaps between sister 
oocytes in cysts (Fig. 4A); the disintegrated 
cysts category phenotype was more severe, 
showing larger abnormal gaps between cyst 
oocytes (Fig. 4A) and clear elongated CBs 
extending between distant oocytes (Fig. 4A). 
Normally, at later stages when oocytes leave 
the cyst, their CBs are detected by MT label- 
ing and appear elongated as they disjoin 
from their sister oocytes (18). Within normal 
cysts, CBs are very challenging to detect by 
either MT (8) or cytoplasmic labeling with- 
out a specific marker (Fig. 4A). This is likely 
because the cytoplasmic membranes of 0o- 
cytes interface tightly, masking the fine and 
short structure of CBs (fig. S4). In disintegrated 
cysts, the elongated CBs are visible without 
labeling of MT or specific markers and are sim- 
ilar to the MT-detected CBs of oocytes that 
naturally leave the cyst later (78). They likely 
represent CBs of oocytes that are separating. 
For quantitative analysis of category pheno- 
types, and because borders between tight cells 
in wt cysts are difficult to detect, we measured 
the distances between the centers of all the 
neighboring nuclei in 3D in wt, cep290! “and 
cep290';kif7'!~ cysts (Fig. 4B, fig. SIOB, and 
movie S17). All distances were normalized to 
the average oocyte diameter per cyst (Fig. 4B 
and fig. SIOB, right panels). Five to eight rep- 
resentative cysts ROIs from each phenotypic 
category were cropped in Fiji and used for 
image processing in IMARIS 3D software. 
We used a semiautomatic segmentation ap- 
proach to create nuclei surfaces. Each nu- 
cleus was segmented in each Z layer to create 
a full 3D surface either by the software auto- 
matic segmentation (intensity based) or by 
marking the nucleus contour manually. After 
creating surfaces for all cyst nuclei, we used 
the measurement point feature to calculate 
the distances between the center of mass of 
all neighboring nuclei. Distance measure- 
ments were exported to an Excel sheet and 
normalized to the average oocyte size [diam- 
eter in micrometers, measured as in (24)] 
per given cyst. Normalized distances were 
exported to GraphPad Prism for statistical 
analysis between groups and generation of 
dot plots. Images of processed cysts were ex- 
ported from IMARIS. 


Correlation analysis of cyst integrity 
and ciliary phenotypes 


In all cyst morphology experiments, we co- 
labeled cilia with AcTub staining and simul- 
taneously analyzed cyst and cilia phenotypes. 
To analyze the correlation between cyst and 
ciliary phenotypes, we pooled gonads from 
across all examined genotypes and categorized 
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them by the effect on cilia, i-e., normal, short 
(truncated), or absent (as in Fig. 2, A to D) 
regardless of their genotype. We then inves- 
tigated how many cysts per ciliary phenotypic 
category were observed in each ovary and, 
based on this distribution, labeled ovaries as 
normal, mildly affected, or severely affected 
(n = 43 gonads, n = 10 to 60 cysts analyzed 
per gonad). We next categorized cyst pheno- 
types in these ovaries and considered dis- 
integrated and isolated oocytes categories as 
severe effects. In most ovaries, a certain cyst 
phenotype category was predominant in at 
least 50% of cysts. The ovary category was 
chosen according to this cyst category or to 
the most frequently detected one in few cases 
where no category was frequent in 50% of 
cysts. The frequency of cyst severity pheno- 
types was plotted for each ciliary phenotype 
(Fig. 4F). cc2d2a mutants were analyzed iden- 
tically (fig. S10C). 


Determining stages of spermatogenesis 


We identified all spermatogenesis stages in 
tubules based on their characteristic nuclear 
morphology and size using DAPI counterstain 
according to (99), as well as the prophase 
markers telo-FISH, Sycp3, and gH2Ax, as de- 
scribed in the text. Briefly, round spermatids 
had small round nuclei (2.5 + 0.1 1m in diame- 
ter) and were found toward the center of the 
tubule as expected. Spermatozoa had compact 
round nuclei (2.1 + 0.1 um in diameter) and 
were found in the lumen of the tubules as ex- 
pected. Early round spermatids had a nuclear 
diameter of 3 + 0.1 um. Round spermatids and 
spermatozoa were negative to all three prophase 
markers. Nuclear diameter of leptotene-zygotene 
spermatocytes and pachytene spermatocytes 
were 5.1 + 0.1 and 6 + 0.1 um, respectively, 
and both were positive for all prophase mark- 
ers, showing the expected pattern of each 
stage (23, 49). 


TEM 


Gonads were dissected from juvenile fish (5 to 
7 wpf, SL ~10 to 15 mm) and fixed as described 
above in PBS containing 4% PFA and 2.5% 
glutaraldehyde overnight at 4°C. Gonads were 
then rinsed four times for 10 min in 0.1 M 
cacodylate buffer, pH 7.4, and postfixed and 
stained with 1% osmium tetroxide, 1.5% po- 
tassium ferricyanide in 0.1 M cacodylate buffer 
for 1 hour. Ovaries were washed four times in 
cacodylate buffer, followed by dehydration in 
increasing concentrations of ethanol consist- 
ing of 30, 50, 70, 80, 90, and 95% for 10 min 
each step, followed by 100% anhydrous ethanol 
three times for 20 min each step, and propylene 
oxide two times for 10 min each step. After 
dehydration, the ovaries were infiltrated with 
increasing concentrations of Agar 100 resin in 
propylene oxide, consisting of 25, 50, 75, and 
100% resin for 16 hours each step. Gonads were 
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embedded in fresh resin and left to polymerize 
in an oven at 60°C for 48 hours. 

Embedded gonads in blocks were sectioned 
with a diamond knife on a Leica Reichert 
Ultracut S microtome, and ultrathin sections 
(80 nm) were collected onto 200-mesh, thin- 
bar copper grids. The sections on grids were 
sequentially stained with uranyl acetate for 
10 min, and with lead citrate for 10 min, and 
imaged with Tecnai 12 TEM 100 kV (Phillips) 
equipped with a MegaView II CCD camera 
and Analysisversion 3.0 software (Soft Imaging 
System). For TEM images, only the brightness 
and contrast were slightly adjusted in Adobe 
Photoshop. These adjustments did not affect 
the biological properties of the imaged cellular 
features. 


IVF 


IVF was performed as in (100). Briefly, sperm 
were collected from anesthetized males into 
Hank’s solution and stored on ice until eggs 
were collected. Sperm from two males of the 
same genotype were collected into a single 
tube and used to fertilize eggs collected from 
control wt, cep290"”, cep290"?”*, cc2ad2a”®, 
or armc9*™*homozygous females. Anesthe- 
tized wt or mutant females were placed in a 
dish and squeezed for egg collection. On hun- 
dred to 150 ul of sperm solution was added to 
the collected eggs and incubated for 20 s. One 
milliliter of E3 containing 0.5% fructose was 
added to activate sperm, gently mixed, and 
incubated for 2 min. Two milliliters of E3 was 
added, followed by 5 min of incubation. The 
dish was then flooded with E3 and placed in a 
28°C incubator until examination of fertiliza- 
tion rates. 


Mouse gonads and IHC 


The C57BL/6 inbred strain of mice was used 
for all experiments. Ovaries from fetal mice at 
embryonic day 14.5 were first fixed in trunk 
with 4% paraformaldehyde in PBS at room 
temperature for 2 hours before being dis- 
sected out and washed briefly in PBS. The 
ovaries were further permeabilized with 0.3% 
Triton X-100 in PBS for 30 min at room tem- 
perature, followed by blocking with 3% bovine 
serum albumin (BSA) in 0.3% Triton X-100 in 
PBS for 1 hour at room temperature. Primary 
antibody incubation was performed at 4 over- 
night. After washing for 20 min six times with 
0.3% Triton X-100 in PBS, the ovaries were 
incubated with secondary antibodies at room 
temperature for 3 hours. The stained ovaries 
were finally washed for 20 min six times with 
0.3% Triton X-100 in PBS and mounted in 70% 
glycerol. 

Testes from postnatal day 12 and 16 mice 
were fixed in the trunk with 4% paraformal- 
dehyde in PBS at room temperature for 2 hours, 
followed by dissection to isolate the testes. The 
whole testes were then processed with cryo- 
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embedding. Representative cross-sections of 
the testes were acquired at anterior, posterior, 
and middle regions with a thickness of 20 um. 
Testes cryo-sections on slides were first thawed 
and dried before drawing borders around the 
sections with a PAP pen (ab2601; Abcam). The 
slides were then fixed with 4% PFA for 15 min 
at room temperature in Coplin jars (all sub- 
sequent steps were performed in Coplin jars 
unless stated otherwise), rinsed twice with ice- 
cold PBS followed by permeabilization with 
0.2% Triton X-100 in PBS for 15 min. 

Next, slides were washed for 5 min three 
times in PBS and blocked with 2% BSA in PBS 
for 2 h at room temperature. The slides were 
then transferred to a humidified box. Primary 
antibodies in PBS (with 0.1% Tween 20 and 1% 
BSA) were pipetted onto the sections and in- 
cubated overnight at 4°C. Slides were washed 
with PBS on a shaker for 10 min six times at 
room temperature. Secondary antibodies in 
PBS (with 0.1% Tween 20 and 1% BSA) were 
then added, and the slides were incubated in 
the humidified box for 5 h at room temperature. 
Finally, the slides were washed for 10 min six 
times with PBS at room temperature, dried, and 
mounted before imaging. 

Primary antibodies used to stain mice ova- 
ries or testes cryo-sections were as follows: 
AcTub (1:500; Sigma-Aldrich), glutamylated 
tubulin (1:500; Adipogen), Vasa (1:500; R&D 
Systems), yTub (1:500; Adipogen), Arl13b 
(1:500; Proteintech), and Adcy3 (1:500; LSBio). 
Alexa Fluor fluorescent secondary antibodies 
Cnvitrogen) were used at 1:500. 


Statistical analysis 


All statistical analysis and data plotting was 
performed using GraphPad Prism 7 software. 
Datasets were tested with two-tailed unpaired 
t test. In the figures, P values are indicated by 
asterisks as follows: *P < 0.05, **P < 0.01, ***P < 
0.001, and ****P < 0.0001 (ns, not significant 
at P > 0.05). All bar graphs were tested with 
chi-squaredFisher’s exact test, and P values 
were at least <0.0001 between wt and cep2907! ze 
cep290" ;kif7"” , or cep2907;kif7 unless oth- 
erwise indicated. 
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INTRODUCTION: Staphylococcus aureus is a bac- 
terial pathogen with a global impact on human 
health. Most individuals carry S. aureus on 
their skin and in their nostrils, but a minority 
develop life-threatening staphylococcal disease. 
The molecular basis of interindividual clinical 
variability upon exposure to and infection with 
S. aureus is unclear. Inherited or acquired dis- 
orders affecting innate, myeloid immunity con- 
fer a predisposition to staphylococcal disease 
but account for only a small proportion of 
cases. Most cases of severe staphylococcal dis- 
ease remain unexplained. 


RATIONALE: Single-gene inborn errors of immu- 
nity are being implicated in a growing num- 
ber of life-threatening infectious diseases. 
We performed a genome-wide study to iden- 
tify the human genetic determinants of severe 
staphylococcal disease. We tested for genetic 
homogeneity in a cohort of patients with un- 
explained life-threatening staphylococcal dis- 
ease by searching for variant enrichment in 
the exomes of 105 cases and 1274 controls. We 
considered rare variants and tested an auto- 
somal dominant mode of inheritance with in- 
complete penetrance. 


OTULIN haploinsufficiency ORAS 
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OTULIN haploinsufficiency is characterized by skin and pulmonary necrosis after S. aureus infection. The 
disorder is phenocopied in patients with 5p -syndrome. The cytotoxic damage to fibroblasts inflicted by the 
staphylococcal a-toxin is facilitated by caveolin-1 accumulation and ADAM1O0 retention at the cell surface. 
OTULIN haploinsufficiency is rescued by naturally elicited o-toxin—neutralizing antibodies. Autoinflammation in 
biallelic OTULIN deficiency is driven by excessive NF-«B activation in myeloid cells. WT, wild-type allele; 

MT, mutant allele; del, deletion; LUBAC, linear ubiquitin assembly complex; M1-Ub, M1-linked ubiquitin. 
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RESULTS: We found enrichment for heterozy- 
gous OTULIN variants in patients with severe 
staphylococcal disease. OTULIN is a linear 
deubiquitinase and negative regulator of nu- 
clear factor kB (NF-«B) signaling encoded by 
a gene on chromosome 5p. Biallelic OTULIN 
mutations cause an early-onset autoinflam- 
matory condition called OTULIN-related auto- 
inflammatory syndrome (ORAS). Probands 
heterozygous for deleterious OTULIN variants 
suffered from life-threatening skin or pulmo- 
nary necrosis. Their disease was typically trig- 
gered by S. awreus infection and occurred from 
adolescence onward. Clinically, penetrance 
was incomplete, and expressivity was variable. 
The dominance mechanism was haploinsuf- 
ficiency, which was both biochemically and clin- 
ically phenocopied in patients with the more 
common 5p- (Cri-du-Chat) chromosomal dele- 
tion syndrome. Blood leukocyte subsets were 
developmentally and functionally unaffected. 
In dermal fibroblasts, OTULIN haploinsuffi- 
ciency increased the levels of linear ubiquitin, but 
tumor necrosis factor (TNF) receptor-mediated 
NF-«B signaling remained intact. Through cross- 
talk with CYLD, OTULIN haploinsufficiency 
caused the accumulation of caveolin-1 complexes 
modified with lysine-63-linked polyubiquitin 
(K63-Ub) chains. Caveolin-1 accumulation in 
the patients’ dermal fibroblasts, but not leuko- 
cytes, enhanced the cytotoxicity of the staphy- 
lococcal virulence factor a-toxin. Upon a-toxin 
binding to its receptor, disintegrin and met- 
alloprotease domain-containing protein 10 
(ADAM10), the receptor was retained at the cell 
surface, enhancing cytotoxicity in the patients’ 
dermal fibroblasts. Impaired cell-intrinsic im- 
munity to o-toxin owing to OTULIN haploin- 
sufficiency was rescued by a-toxin-neutralizing 
antibodies. 


CONCLUSION: By disrupting cell-intrinsic im- 
munity to a-toxin in fibroblasts and, perhaps, 
other nonleukocytic cells, haman OTULIN 
haploinsufficiency underlies life-threatening 
staphylococcal disease of the skin and lungs. 
Naturally elicited a-toxin-neutralizing anti- 
bodies in heterozygotes may contribute to in- 
complete clinical penetrance. The study of 
OTULIN haploinsufficiency, a rare inborn error 
of immunity, helped clarify a phenotype seen 
in individuals with 5p— syndrome, a more com- 
mon chromosomal abnormality. 
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The molecular basis of interindividual clinical variability upon infection with Staphylococcus aureus is 
unclear. We describe patients with haploinsufficiency for the linear deubiquitinase OTULIN, encoded by a 
gene on chromosome 5p. Patients suffer from episodes of life-threatening necrosis, typically triggered 
by S. aureus infection. The disorder is phenocopied in patients with the 5p- (Cri-du-Chat) chromosomal 
deletion syndrome. OTULIN haploinsufficiency causes an accumulation of linear ubiquitin in dermal 
fibroblasts, but tumor necrosis factor receptor-mediated nuclear factor «B signaling remains intact. 
Blood leukocyte subsets are unaffected. The OTULIN-dependent accumulation of caveolin-1 in dermal 
fibroblasts, but not leukocytes, facilitates the cytotoxic damage inflicted by the staphylococcal virulence 
factor a-toxin. Naturally elicited antibodies against a-toxin contribute to incomplete clinical penetrance. 


Human OTULIN haploinsufficiency underlies life-threatening staphylococcal disease by disrupting 
cell-intrinsic immunity to a-toxin in nonleukocytic cells. 


taphylococcus aureus is a major bacterial 

pathogen with a global impact on human 

health (7). Individuals with staphylococ- 

cal disease may suffer from a range of 

superficial (folliculitis, cellulitis, abscesses) 
to severe (e.g., necrotizing skin and soft tissue 
infections and necrotizing pneumonia, with or 
without septicemia) manifestations (1). Most 
individuals carry S. awreus asymptomatically 
on their skin and in their nostrils for long pe- 
riods of their lives, but only a minority develop 
life-threatening staphylococcal disease. Se- 
vere disease has a poor prognosis, owing to its 
rapid invasive course (2-5). Acquired risk fac- 
tors for staphylococcal disease include surgery 
and intravascular devices (1). Life-threatening 
staphylococcal disease can also result from 
single-gene inborn errors of immunity (IEIs) 
(6). Disorders affecting phagocyte develop- 
ment or function, such as severe congenital 
neutropenia, chronic granulomatous disease, 
and leukocyte adhesion deficiency, confer a 
predisposition to staphylococcal disease (6, 7). 
Disorders of the Toll-like receptor (TLR) and 
interleukin-1 receptor (IL-1R) nuclear factor 
«B (TIR-NF-«B) pathway (8-12) and of IL-6- 
and signal transducer and activator of tran- 
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scription 3 (STAT3)-dependent immunity 
(13-17) have also been identified in patients 
suffering from severe staphylococcal disease. 
All these defects preferentially affect innate, 
myeloid immunity, but, collectively, they ac- 
count for only a small proportion of cases (78). 
Most cases of severe staphylococcal disease 
remain unexplained, because they strike other- 
wise healthy individuals with no detectable 
phagocyte defect (2, 19). In this study, we 
aimed to elucidate human genetic and immu- 
nological etiologies of life-threatening staphy- 
lococcal disease. 


Results 
Genome-wide enrichment in rare OTULIN variants 


Given the severity of their disease, we hypoth- 
esized that there would be at least some ge- 
netic homogeneity in our cohort of patients 
with unexplained life-threatening staphylo- 
coccal disease. We sequenced the exomes of 
105 index cases. Given the rarity of severe 
staphylococcal disease in otherwise healthy 
individuals, and assuming a dominant mode 
of inheritance, we hypothesized that the 
disease-causing variants would be very rare 
[minor allele frequency (MAF) < 1 x 107°] (20). 
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We also prioritized variants predicted to be 
deleterious [combined annotation-dependent 
depletion score (CADD) > mutation significance 
cutoff (MSC)] (27-23). We performed the same 
analysis on 1274 control exomes from patients 
with mycobacterial diseases (including indi- 
viduals with Mendelian susceptibility to myco- 
bacterial disease or those with tuberculosis), 
which rarely overlap with staphylococcal dis- 
ease (24). After adjusting for ethnicity by prin- 
cipal components analysis (PCA), we used the 
results of these two analyses to test the null 
hypothesis that variants of a given gene were 
not specific to staphylococcal disease. OTULIN 
was the only gene satisfying the threshold 
for statistical significance on a genome-wide 
level, with a P value of 5.74 x 10” (Fig. 1A and 
tables S1 and S2). We repeated our analysis 
for OTULIN, adding 2504 individuals from 
the 1000 Genomes Project (25) to our control 
dataset (26); none of these individuals had 
developed life-threatening S. aureus infections. 
The enrichment in OTULIN variants was even 
stronger (P = 3.85 x 10°) in this analysis, indi- 
cating that very rare variants of this gene were 
found specifically in patients with severe staphy- 
lococcal disease. 


The patients carry heterozygous OTULIN variants 


The linear deubiquitinase OTULIN is a nega- 
tive regulator of inflammation (27, 28). In 
humans, biallelic OTULIN mutations cause a 
potentially fatal early-onset autoinflamma- 
tory condition called OTULIN-related auto- 
inflammatory syndrome (ORAS) (29-33). Closer 
inspection revealed that the signal for genetic 
homogeneity in the cohort of patients with 
severe staphylococcal disease was driven by 
three probands carrying heterozygous variants 
of OTULIN (Fig. 1B): one missense (p.D246V), 
one nonsense (p.E95X), and one frameshift 
(p.D268TfsX5) variant. In each of these pro- 
bands, the clinical hallmark of disease after 
infection with S. aureus was life-threatening 
necrosis of the skin and/or lungs [fig. SLA; 
see supplementary materials (SM)]. Given the 
extreme inflammatory responses to infection 
with S. aureus in these patients, we investi- 
gated the possibility of other triggers gen- 
erating similar disease. We identified three 
additional kindreds with very rare heterozy- 
gous missense variants of OTULLN (p.N341D, 
p.P254S, and p.R263Q) (Fig. 1C). The probands 
of these kindreds presented severe diseases 
triggered by infectious and unknown etiologies 
(fig. S1B; SM). The clinical course of disease in 
patients with no documented infection suggests 
that low-grade infections, or noninfectious 
triggers, might induce an inflammatory phe- 
notype similar to that of patients with doc- 
umented S. aureus infection (fig. S1, A and 
B; SM). In total, we identified six very rare 
variants in seven patients from six kindreds 
(fig. SIC). These seven patients did not carry 
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variants fitting the known modes of inheri- 
tance for any of the 430 genes already im- 
plicated in IEIs (7, 34). In most patients, the 
first episode of disease occurred during ado- 
lescence (SM). Clinical data indicated a degree 
of phenotypic heterogeneity in the index cases, 
but with the skin and/or lungs consistently 
affected in all cases (Fig. 1, B and C, and fig. S1, 
A and B; SM). One-third of the heterozygous 
relatives (three of the nine for whom data 
were available) expressed a related but milder 
phenotype, the other six carriers being appar- 
ently healthy (Fig. 1, B and C; SM). We then 
hypothesized that the parents of the patients 
with biallelic OTULIN deficiency reported in 
previous studies (29-32), who themselves 
carried deleterious OTULIN alleles in the 
heterozygous state, might present pheno- 
copies of the disorder observed in the patients 
of our cohort. One-third of the parents (3 of 10) 
did indeed express the phenotype (Fig. 1D; 
SM). Thus, very rare heterozygous mutations 
of OTULIN confer predisposition to severe 
necrosis of the skin and lungs, typically, but 
not exclusively, after infection with S. aureus, 
with variable expressivity and incomplete 
penetrance. 


OTULIN is subject to negative selection 


The severity of disease in the probands carry- 
ing very rare heterozygous OTULIN variants 
suggests that this gene is subject to evolution- 
ary forces acting at the population level. The f 
parameter score for OTULIN is 0.36, indicat- 
ing that this gene is under negative selection 
(fig. SID) (35). Moreover, the consensus score 
for negative selection (CoNeS) for OTULIN is 
-0.78, within the reported range for genes 
underlying IEIs with both autosomal dom- 
inant (AD) and autosomal recessive (AR) in- 
heritance (fig. SIE) (36). OTULIN has a LoFtool 


score of 0.115, suggesting that it does not 
tolerate haploinsufficiency (37). Predicted 
loss-of-function (pLOF) OTULIN variants are 
very rare in the general population, with a 
cumulative MAF of 1 x 10~* (fig. SIF) (20). 
Consistent with the negative selection pres- 
sure acting on OTULIN, the alleles of the pa- 
tients were found to be either ultra-rare or 
private (fig. SIF) (20). These population ge- 
netics parameters indicate that heterozygous 
pLOF variants of OTULIN are disadvantageous 
for the individual. This finding is consistent 
with the hypothesis that the very rare variants 
for which enrichment was detected in the pa- 
tients studied are causal for the life-threatening 
necrosis of the skin and/or lungs triggered by 
S. aureus infection. 


The OTULIN alleles of the patients are severely 
hypomorphic or amorphic 


We overexpressed the cDNAs corresponding 
to the OTULIN alleles of the patients in hu- 
man embryonic kidney (HEK) 293T cells. The 
truncated cDNAs of the patients demonstrated 
loss of expression, whereas the missense cDNAs 
of the patients and cDNAs corresponding to 
two OTULIN alleles commonly found in the 
general population (20) were expressed at nor- 
mal levels (fig. SIG). No in-frame reinitiation 
was observed for truncated alleles from the 
patients (fig. SIH). We then assessed the de- 
ubiquitinase activity of the products of the 
alleles from the patients and their capacity to 
inhibit NF-«B signaling. Consistent with pub- 
lished findings in overexpression systems, 
some, but not all, protein products of the 
alleles concerned were defective for the linear 
deubiquitination of NF-«B essential modula- 
tor (NEMO) in the presence of linear ubiquitin 
assembly complex (LUBAC) (fig. SID (32, 33). 
We assessed the capacity of the gene products 


to regulate receptor signaling, by investigating 
the functional consequences of the OTULIN 
alleles of the patients for NF-«B inhibition 
after stimulation with TNF in a cellular sig- 
naling system. All the alleles from patients 
were severely hypomorphic or amorphic (fig. 
S1J). In contrast to the alleles of the patients, 
the two alleles common in the general pop- 
ulation (20) were found to be isomorphic (fig. 
S1J). The apparent differences between the 
two overexpression systems may reflect differ- 
ent impacts of the various OTULIN alleles on 
ubiquitin binding capacity (27, 32, 33). We 
then performed functional tests for all the 
other OTULIN coding sequence variants re- 
ported in public databases (NV = 120 variants) 
(20). The cumulative MAF of amorphic and 
hypomorphic OTULIN alleles (with <25% wild- 
type levels of NF-«B signaling inhibitory activ- 
ity) in the general population was 6 x 10° 
(Fig. 1E). The greater rarity of amorphic and 
hypomorphic OTULIN alleles than of pLOF 
variants highlights the importance of experi- 
mental validation for pLOF variants. Thus, the 
rare OTULIN variants found in the patients 
are deleterious, and AD OTULIN deficiency 
predisposes the patients to disease. 


Autosomal dominant OTULIN deficiency acts 
through haploinsufficiency 


We tested the genetic mechanism of AD 
OTULIN deficiency and found no negative 
dominance for the alleles expressed by the 
patients (fig. SIK). This implies that the ge- 
netic mechanism underlying AD OTULIN 
deficiency is haploinsufficiency. OTULIN is 
located on the short arm of chromosome 5. 
Depending on the breakpoint, almost all indi- 
viduals with 5p— syndrome—the most common 
chromosomal deletion syndrome in humans, 
also known as Cri-du-Chat syndrome—are 
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Fig. 1. OTULIN haploinsufficiency and its molecular characterization. 

(A) Genome-wide enrichment in rare, predicted deleterious variants in the cohort 
of 105 patients with severe staphylococcal disease, relative to 1274 patients 
with mycobacterial disease. (B) Pedigrees of the kindreds presenting severe 
necrotizing staphylococcal disease and carrying heterozygous mutations 

of OTULIN. (C) Pedigrees of the kindreds carrying heterozygous mutations of 
OTULIN and presenting necrosis triggered by other etiologies. (D) Pedigrees 

of the kindreds of ORAS probands carrying biallelic mutations of OTULIN. 

(E) Functional population genetics based on MAFs reported in gnomAD and 


haploinsufficient for OTULIN by definition 
(38-40). The prevalence of 5p— syndrome is 
~1in 50,000 live births, similar to the cumu- 
lative MAF for amorphic and hypomorphic 
OTULIN alleles (Fig. 1E) (41). Respiratory 
tract infections are one of the principal causes 
of hospitalization in individuals with 5p- 
syndrome, and pneumonia is among the most 
common causes of death in these individuals 
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and tables S1 and S2. 


(40, 42-44). We recruited six 5p— syndrome 
patients with a breakpoint centromeric to 
OTULIN (Fig. 1F and fig. SIL). Expressivity 
was variable and penetrance was incomplete, 
but one-third of these OTULIN-haploinsufficient 
5p-— syndrome patients (two of six) presented 
an age-dependent phenocopy of the disorder 
observed in patients with heterozygous OTULIN 
mutations (Fig. 1F; SM). We also identified 
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the NF-«B inhibitory capacity of the OTULIN variants, as assessed with a 
luciferase reporter system in transiently transfected HEK293T cells stimulated 
with TNF (data points represent the mean of N = 4 to 5 per variant). 

(F) Pedigrees of the kindreds of the 5p- syndrome probands carrying deletions 
with a breakpoint centromeric to OTULIN. The patients are shown in ascending 
age order (left to right, top to bottom). (G) Pedigree of the 5p— syndrome 
kindred with probands carrying a deletion with a breakpoint telomeric to OTULIN. 
WT, wild-type allele; MT, mutant allele; LOF, loss of function. See also fig. S1 


two individuals from a multigeneration kindred 
affected by 5p— syndrome with an extremely 
rare breakpoint telomeric to OTULIN (Fig. 1G 
and fig. S1L) (38). Both these individuals carried 
two copies of OTULIN and did not display the 
phenotype studied here (SM). The clinical 
similarities between patients with heterozy- 
gous OTULIN mutations and those with 5p-— 
syndrome suggest a common mechanism of 
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predisposition to infection due to haploinsuf- 
ficiency for OTULIN. 


Immunological characterization 
of OTULIN-haploinsufficient PBMCs 


The autoinflammatory features seen in ORAS 
patients are largely driven by abnormally high 
levels of NF-«B activation in myeloid cells 
(29, 30, 33). We thus investigated whether 
patients with OTULIN haploinsufficiency 
presented any related immunological dis- 
turbances. OTULIN expression in monocytes 
differed between patients and healthy controls 
(fig. S2A). Routine immunological tests per- 
formed in diagnostic laboratories [including 
assessments of leukocyte differentiation and 
oxidative burst capacity—deficiencies of which 
are known to confer a predisposition to staphy- 
lococcal disease (6, 7)] revealed no explanatory 
defects in patients with OTULIN haploinsuffi- 
ciency (SM). We thus looked for more-subtle 
immunological features by testing periph- 
eral blood mononuclear cells (PBMCs) from 
the patients using mass cytometry [cytome- 
try by time-of-flight (CyTOF)] (45) and RNA 
sequencing. In comparisons with healthy 
controls, we observed no differences in the 
abundance of leukocyte subsets or in the ex- 
pression of lineage markers (Fig. 2A and fig. 
82, B to D). The development of myeloid and 
lymphoid subsets in patients with OTULIN 
haploinsufficiency was, therefore, apparently 
normal. At baseline, we observed a complex 
transcriptional signature not associated with 
NF-«B-driven inflammation in the PBMCs of 
the patients (Fig. 2B and fig. S2E). This dif- 
ference between the patients and healthy con- 
trols disappeared after stimulation (Fig. 2B). 
Both at baseline and after stimulation, the 
PBMCs of the patients had a capacity to secrete 
various cytokines, including TNF and IL-1, 
similar to that of PBMCs from healthy con- 
trols (fig. S2F). We detected no defect of IL-6 


Fig. 2. OTULIN haploinsufficiency 
does not impair the hematopoietic 
immune system. (A) Aggregate uniform 
manifold approximation and projection 
(UMAP) plots of PBMCs from patients 
with AD OTULIN deficiency and healthy 
controls, as assessed by CyTOF. 

NK, natural killer T cells; DN T, double- 
negative T cells; MAIT, mucosal- 
associated invariant T cells; B, B cells; 
pDC, plasmacytoid dendritic cells; cDC, 
conventional dendritic cells. (B) Principal 
components analysis plot of the tran- 
scriptional profile of PBMCs at baseline 
and after incubation with various stimuli, 
in comparison with healthy controls. 
Arrows have been added to illustrate the 
direction of variance after stimulation. 


UMAP 2 


production (fig. S2F), deficiencies of which 
are known to be associated with staphylo- 
coccal disease (6). Single-cell RNA sequencing 
revealed a contribution of CD14* monocytes 
to the transcriptional signature in the PBMCs 
of the patients relative to healthy controls 
(fig. S2, G and H). However, consistent with 
their capacity to produce normal amounts of 
TNF and IL-1f (fig. S21), the amounts of phos- 
phorylated P65 in CD14* monocytes from 
the patients were normal both at baseline 
and after stimulation (fig. S2J). The CD14* 
monocytes of patients with OTULIN hap- 
loinsufficiency thus did not appear to be 
functionally affected. Finally, induced pluri- 
potent stem cell (iPSC)-derived macrophages 
from patients had normal oxidative burst 
and phagocytic capacities relative to healthy 
controls (fig. S2, K and L). These observations 
suggest an absence of overt immunological 
disturbance in patients with OTULIN hap- 
loinsufficiency, contrasting with the auto- 
inflammatory markers described in ORAS 
patients (29-33). 


OTULIN gene dosage—dependent accumulation 
of linear ubiquitin in fibroblasts 


We next performed a biochemical characteri- 
zation of the nonhematopoietic compart- 
ment in the patients and assessed the levels of 
OTULIN expression in primary dermal fibro- 
blasts (PDFs). OTULIN mRNA levels varied 
(fig. S3, A and B), but the levels of the cor- 
responding protein were low in the patients 
and depended on the nature of the mutation 
(Fig. 3, Aand B). OTULIN haploinsufficiency 
led to an accumulation of aggregates contain- 
ing linear ubiquitin (M1-Ub) in immortalized 
fibroblasts from patients, including 5p- 
syndrome patients with a breakpoint centro- 
meric to OTULIN (Fig. 3, C and D, and fig. S3C). 
These aggregates were sensitive to treatment 
with exogenous OTULIN (fig. S3D), and their 


CD14* Monocytes 
CD141- CDic- mDC 
Intermediate Monocytes 
CD16* Monocytes 

pDC 

cDC1 

@ Plasmablast @ cDC2 


NS, not stimulated; HKSA, heat-killed S. aureus; LPS, lipopolysaccharide. See also fig. S2. 
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accumulation was rescued by genetic comple- 
mentation with wild-type OTULIN (Fig. 3E 
and fig. S3E). We used a combination of an 
M1-Ub-selective tandem Ub-binding entity 
(TUBE) and absolute quantification (AQUA) 
tandem mass spectrometry (MS/MS) to as- 
sess M1-Ub levels in immortalized fibro- 
blasts from the patients. M1-Ub levels were 
much higher in the cells of both ORAS and 
OTULIN-haploinsufficient patients than in 
those of healthy controls and HOIL1-deficient 
patients (Fig. 3F) (46). Thus, OTULIN haplo- 
insufficiency and recessive deficiency result 
in the cellular accumulation of M1-Ub in a gene 
dosage-dependent manner. 


Normal TNF receptor signaling 
in OTULIN-haploinsufficient fibroblasts 


In the PDFs of ORAS patients, expression 
of the components of LUBAC decreases to 
compensate for recessive OTULIN deficiency 
(Fig. 3A) (29, 30, 32). This compensatory 
mechanism impaired TNF receptor signaling 
in immortalized fibroblasts from ORAS pa- 
tients (fig. S3F). By contrast, OTULIN haplo- 
insufficiency did not lead to a loss of LUBAC 
expression in PDFs (Fig. 3A) or to impaired 
TNF receptor signaling (fig. S3F) or IL-6 and 
IL-8 secretion (fig. S3G). Indeed, no differen- 
tial transcription patterns were evident in the 
patients’ PDFs after stimulation with TNF (fig. 
S3H). The impairment of early NF-«B activa- 
tion sensitizes PDFs from ORAS and LUBAC- 
deficient patients to TNF-induced apoptotic 
cell death under stress conditions (29, 46). 
However, consistent with their normal LUBAC 
expression, PDFs from patients with OTULIN 
haploinsufficiency were not susceptible to 
stress-induced apoptosis upon exposure to 
TNF (fig. S31). Thus, the functional dysregu- 
lation of TNF receptor signaling via NF-«B is 
limited to fibroblasts displaying a recessive 
OTULIN deficiency. 
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Global dysregulation of transcription 

in OTULIN-haploinsufficient fibroblasts 

The presence of a biochemical phenotype in 
the absence of an immunological phenotype 
after stimulation in OTULIN-haploinsufficient 


fibroblasts led us to investigate transcriptional 
homeostasis in resting PDFs from the patients. 
Consistent with the gene dosage-dependent 
accumulation of M1-Ub in immortalized fibro- 
blasts, this approach revealed a continuous 
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Fig. 3. OTULIN gene dosage-dependent accumulation of M1-Ub in fibroblasts. 
(A) Expression of OTULIN, HOIP, HOIL1, and SHARPIN in PDFs. (B) Relative 
abundance of OTULIN peptides in PDFs, as determined by mass spectrometry 
analysis (median). (©) Accumulation of aggregates containing M1-Ub in 
immortalized fibroblasts, as assessed by immunohistochemistry. Representative 
images. (D) Quantification of the aggregate accumulations seen in (D) (median). 
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genotype-dependent transcriptional pheno- 
type in the basal state (Fig. 4A). Despite an 
absence of functional consequences of OTULIN 
haploinsufficiency (fig. S3, F to I), specific 
gene set enrichment analyses of the resting 
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(E) Quantification of aggregates containing M1-Ub in immortalized fibroblasts after 
rescue with wild-type OTULIN, as compared with cells rescued with an empty 
virus (mean + SD). (F) Abundance of M1-Ub, as determined by AQUA-MS/MS 
analysis of immortalized fibroblasts (median). Statistical significance was 
calculated by analysis of variance (ANOVA) with Dunnett's post hoc correction 
for multiple comparisons. See also fig. S3. 
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transcriptome revealed a subtle signal for TNF 
receptor signaling via NF-«B (fig. S4A). These 
analyses also revealed a transcriptional signa- 
ture affecting various other cellular processes 
relative to cells from healthy controls (fig. S4A) 
(47). The affected processes included the MYC 
and E2F transcription factor systems and the 
G2/M cell cycle checkpoint system, the gene sets 
for these systems displaying a partial overlap 
(fig. S4B). These findings indicate a global dys- 
regulation of transcription in resting OTULIN- 
haploinsufficient fibroblasts. 


Accumulation of caveolin-1 
in OTULIN-haploinsufficient fibroblasts 


The observed patterns of transcription could 
not be explained by the known function of 
OTULIN as a linear deubiquitinase. We thus 
hypothesized that OTULIN haploinsufficiency 
affects cellular homeostasis posttranslationally, 
in an as yet unknown manner. The proteome 
of the resting PDFs from the patients dis- 
played a continuous cellular phenotype, con- 
sistent with the transcriptome (Fig. 4B and fig. 
S4C). Focusing on this continuous proteomic 
phenotype, we observed a pattern of differ- 
ential abundance for a cluster of 11 proteins 
including caveolin-2 and gelsolin (fig. S4, D to 
F). A STRING-interaction network analysis of 
this protein cluster identified a node connect- 
ing caveolin-2 and gelsolin: caveolin-1 (Fig. 4C). 
Conventional techniques revealed the accu- 
mulation of SDS-resistant high-molecular 
weight (MW) caveolin-1-containing complexes 
(48, 49) in PDFs from the patients in compar- 
ison with those from healthy controls (Fig. 4, D 
and E). We analyzed caveolin-1 immunopur- 
ification products (IPs) from PDF whole-cell 
lysates (WCLs) from one healthy control and 
one patient with recessive OTULIN deficiency 
by liquid chromatography and tandem mass 
spectrometry (LC-MS/MS). Validating the 
STRING-interaction network (Fig. 4C), both 
sets of caveolin-1 IPs also contained caveolin-2, 
whereas gelsolin was detected in the OTULIN- 
deficient sample only (fig. S4G). Caveolin-1 lev- 
els vary between cell types and are highest in 
structural cells, such as fibroblasts (50, 57). 
By contrast to our observations for PDFs, 
caveolin-1 was barely detectable in hemato- 
poietic cells, and the abundance of caveolin-1 in 
monocytes did not differ between OTULIN- 
haploinsufficient patients and healthy con- 
trols (fig. S4H). We detected no accumulation 
of caveolin-1-containing complexes in PDFs 
from the 5p- syndrome patient carrying two 
copies of the OTULIN gene relative to healthy 
controls (Fig. 4E and fig. S41). However, as in 
PDFs from the patients, caveolin-1 accumu- 
lated as a high-MW complex in an isogenic 
OTULIN-knockout PDF cell line (Fig. 4F and 
fig. S4J). Thus, OTULIN haploinsufficiency 
results in a selective accumulation of high- 
MW caveolin-1 in PDFs. 
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Lysine-63-linked polyubiquitination of caveolin-1 
The differences in gelsolin abundance in PDFs 
were attributed to differences in GSN tran- 
scription, but no differential transcription 
was detected for CAV/ and CAV2 (fig. S5A). 
Caveolin-1 is a key component of cell mem- 
brane microdomains, in which it acts as a scaf- 
fold for other proteins and receptors (47, 52). A 
signal for plasma membrane disturbance was 
detected at the proteomic but not the tran- 
scriptional level in the patients’ PDFs (fig. 
S5B), suggesting a posttranslational role for 
OTULIN in the regulation of caveolin-1 ex- 
pression in PDFs. Caveolin-1 is decorated with 
ubiquitin (48, 49, 53). We hypothesized that 
OTULIN haploinsufficiency and recessive 
deficiency affect the ubiquitination status of 
caveolin-1. We found that caveolin-1 IPs from 
PDF-WCLs from one healthy control and one 
patient with recessive deficiency of OTULIN 
contained a high-MW ubiquitin-containing 
complex (fig. S5C). We then used LC-MS/MS 
to characterize the ubiquitination profiles of 
purified caveolin-1 in solution, in the high- 
MW complex fraction, and in the monomeric 
fraction. Lysine-48-linked polyubiquitin (K48- 
Ub) chains were conjugated to the high-MW 
caveolin-1-containing complex regardless of 
genotype (fig. S5D). We detected no M1-Ub 
in any of the caveolin-1 IP fractions (fig. S5, 
C and D). Instead, the high-MW caveolin- 
1-containing complex was abundantly dec- 
orated with lysine-63-linked polyubiquitin 
(K63-Ub) chains in the OTULIN-deficient 
sample but not in the healthy control (Fig. 
5A and fig. S5C) (49). No other branched 
polyubiquitin linkages were detected in the 
caveolin-1 IP fractions. The remaining un- 
modified ubiquitin was probably conjugated 
as a monomer to caveolin-1 (fig. S5D) (53). 
The OTULIN-dependent accumulation of 
caveolin-1 complexes modified with K63-Ub 
but not M1-Ub chains indicates cross-talk be- 
tween OTULIN and other ubiquitin ligases 
and/or hydrolases. 


CYLD binds linear ubiquitin 


We characterized the cross-talk between 
OTULIN and K63-Ub chains by treating pu- 
rified caveolin-1 from one ORAS patient with 
a panel of recombinant deubiquitinases. Con- 
sistent with the respective specificities of the 
various hydrolases, the K63-linked polyubi- 
quitination of high-MW caveolin-1 was re- 
duced by treatment with recombinant CYLD 
but not OTULIN (Fig. 5B and fig. S5E). Sim- 
ilarly, the overexpression of CYLD but not 
OTULIN in PDFs from a patient with ORAS 
rescued the K63-linked polyubiquitination 
of high-MW caveolin-1 (Fig. 5C and fig. S5F). 
Unexpectedly, a stronger rescue was observed 
when both CYLD and OTULIN were overex- 
pressed (Fig. 5C and fig. S5F). Caveolin-1 co- 
localized with K63-Ub and, to some extent, 
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with M1-Ub aggregates in PDFs (fig. S5G). 
We used a combination of TUBE and high- 
input AQUA-MS/MS in immortalized fibro- 
blasts to identify proteins bound to M1-Ub. 
We detected LUBAC components (fig. S5H), 
which are known to be decorated with M1-Ub 
(29). We also identified caveolin-1 and CYLD 
as possibly binding to M1-Ub (fig. S5H) and 
used cells from a HOILI-deficient patient to 
validate specificity (46). The detection of 
caveolin-1 was not specific (fig. S51), consistent 
with the absence of M1-Ub in the caveolin-1 IP 
(Fig. 5A and fig. S5, C and D). We confirmed 
the specific binding of CYLD to M1-Ub in an 
OTULIN gene dosage-dependent manner 
(Fig. 5D and fig. S5I). The M1-Ub-bound 
CYLD species had a higher MW than that of 
the total CYLD pool (fig. S5I), suggesting post- 
translational modification. We identified 
no Gly-Gly ubiquitination signature sites on 
CYLD by proteomic approaches, suggesting 
that the posttranslational modification is not 
ubiquitination itself and that the binding 
of CYLD to M1-Ub is noncovalent. Unlike 
an OTULIN gene-dosage effect, M1-Ub-bound 
HOIP was detected in cells from ORAS pa- 
tients only (Fig. 5D and fig. S5, I and J). These 
data reveal a direct, LUBAC-independent 
but OTULIN-dependent binding of M1-Ub 
to CYLD. 


Colocalization of caveolin-1 with S. aureus a-toxin 


Caveolin-1 clusters in membrane microdomains 
(52) and has been reported to colocalize with 
a disintegrin and metalloprotease domain- 
containing protein 10 (ADAM10) (54). ADAM10 
is a cell surface receptor used by the staphy- 
lococcal virulence factor a-toxin (54, 55). 
a-Toxin is a pore-forming cytotoxin triggering 
ADAM10-mediated cell death in a process 
facilitated by caveolin-1 (56). We hypothesized 
that the OTULIN-dependent caveolin-1 accu- 
mulation in the PDFs from the patients would 
contribute to the adverse outcomes of their 
disease. We thus investigated the cellular 
kinetics of o-toxin binding. After its addi- 
tion to PDFs, a-toxin colocalized with both 
ADAM)10 and caveolin-1 (Fig. 6A and fig. S6, A 
and B). ADAM10 and caveolin-1 were colocal- 
ized to some extent in the presence or absence 
of toxin binding (Fig. 6A and fig. S6, A and 
B). The patients’ PDFs displayed moderately 
higher levels of o-toxin binding than cells from 
healthy controls (Fig. 6B and fig. S6C). This 
observation could not be explained by differ- 
ential ADAM10 expression at baseline (fig. S6, 
D and E). However, after a-toxin binding, the 
expression of ADAM10 at the cell surface was 
greater in PDFs from patients than in those 
from healthy controls (fig. S6F). These data sug- 
gest a role for caveolin-1 in retaining ADAM10 
at the surface of the cell when OTULIN- 
haploinsufficient fibroblasts are exposed to 
a-toxin (57, 58). 
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Fig. 4. OTULIN-dependent accumulation of caveolin-1 in fibroblasts. 

(A) Transcriptional profile of unstimulated PDFs, as assessed by RNA sequencing, 
expressed as logs fold changes (FC) relative to the mean value for controls. 
Genes satisfying the threshold for statistical significance in comparisons of ORAS 
patients to controls are shown. (B) Proteomic profiles of unstimulated PDFs, 

as assessed by mass spectrometry on WCLs, expressed as Z-scores. Genes 
satisfying the threshold for statistical significance in comparisons of ORAS 
patients to controls are shown. (C) STRING analysis of the cluster of proteins with 
differential abundances in various genotypes identified in (B). (D) Accumulation 
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of SDS-resistant high-MW caveolin-1-containing complexes in PDF-WCLs. 

(E) High-MW caveolin-1-containing complex intensities relative to B-actin 
intensities in PDF-WCLs, normalized against the mean value of healthy controls 
(data points indicate the mean of N = 2 replicates per patient, median per group). 
(F) High-MW caveolin-l-containing complex intensities relative to B-actin 
intensities in PDF-WCLs treated with an sgRNA pool targeting OTULIN (sgOTU), 
normalized against the mean value in those treated with a nontargeting 

control sgRNA (sgNTC) (median, N = 4). The statistical significance of differences 
was assessed in Student's t tests. See also fig. S4. 
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Fig. 5. Polyubiquitination of caveolin-1. (A) Analysis of K63-Ub on caveolin-1 
by LC-MS/MS on purified caveolin-1 from PDF-WCLs. Caveolin-1 was analyzed 
in solution, in the high-MW complex fraction, and in the monomeric (Mono) 
fraction. (B) High-MW K63-Ub complex intensities relative to caveolin-1 
intensities in recombinant deubiquitinase-treated purified caveolin-1 from 
PDFs. (€) High-MW K63-Ub complex intensities relative to caveolin-1 


OTULIN haploinsufficiency impairs intrinsic 
immunity to o-toxin 

We hypothesized that OTULIN haploinsuffi- 
ciency confers a predisposition to o-toxin- 
induced cell death in the PDFs of the patients. 
Indeed, these cells displayed an enhanced sus- 
ceptibility to S. aureus culture supernatant- 
elicited cytotoxicity (Fig. 6C). This cytotoxicity 
was antagonized by prior treatment of the 
supernatant with a-toxin-neutralizing mono- 
clonal antibodies (fig. S6G), indicating a major 
contribution of a-toxin to cytotoxicity in these 
cells. OTULIN genotype-dependent suscep- 
tibility to a-toxin was confirmed with recom- 
binant o-toxin (Fig. 6D and fig. S6H). The 
absence of a particular phenotype after the 
treatment of cells from patients with strepto- 
lysin O (SLO) and aerolysin—microbial toxins 
produced by other bacteria and not requiring 
specific proteinaceous host-cell surface re- 
ceptors (59)—indicates a specific susceptibility 
of the OTULIN-haploinsufficient PDFs from 
patients to o-toxin (Fig. 6D and fig. S6H). 
Normal susceptibility to o-toxin was observed 
in PDFs from a 5p— syndrome patient carrying 
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See also fig. S5. 


two copies of the OTULIN gene, confirming 
the OTULIN-dependence of the phenotype 
(Fig. 6E and fig. S61). o-Toxin elicited higher 
levels of cell surface metalloprotease activa- 
tion in PDFs from the patients than in those of 
healthy controls (Fig. 6F) (55). Again, a 5p— 
syndrome patient carrying two copies of the 
OTULIN gene behaved like controls (Fig. 6F). 
Activity levels were correlated with ADAM10 
levels after o-toxin binding (fig. S6J). Prior 
treatment of the PDFs with an ADAM10- 
selective inhibitor blocked o-toxin-induced 
metalloprotease activation completely in con- 
trols, but activation levels in patients were 
decreased only slightly, to levels seen in un- 
treated controls (Fig. 6F). Moreover, to various 
extents, ADAM10 inhibition protected the 
patients’ PDFs against a-toxin cytotoxicity 
(fig. S6K). Prior treatment with cyclodextrin, 
a chemical product disrupting caveolin-1- 
enriched membrane microdomains (60), pro- 
tected PDFs from the patients against o-toxin- 
induced cytotoxicity (fig. S6K). The susceptibility 
of an OTULIN-deficient PDF cell line was 
partially rescued by knocking out CAV/ but 
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HOIL1-/--e 


intensities in purified caveolin-1 from PDFs after rescue with CYLD and/or 
OTULIN. (D) Intensities of CYLD and HOIP bound to M1-Ub, as detected 

in TUBE pull-downs from immortalized fibroblasts, relative to intensities of 
total CYLD and HOIP, respectively. Statistical significance was calculated 
by ANOVA with Dunnett's post hoc correction for multiple comparisons. 


not by knocking out CAV2 (Fig. 6G and fig. 
S6L), further supporting a facilitating role 
of caveolin-1 in a-toxin cytotoxicity (54). Con- 
sistent with the low levels of caveolin-1 in 
hematopoietic cells from the patients, these 
cells displayed no OTULIN-dependent sus- 
ceptibility to a-toxin (fig. S6M). Murine PDFs, 
unlike their human counterparts, were resist- 
ant to o-toxin-elicited cell death regardless 
of Otulin genotype (fig. S6N). Thus, OTULIN 
haploinsufficiency results in a human-specific 
cell-intrinsic susceptibility of nonhematopoietic 
cells to the major S. awreus virulence factor 
a-toxin. 


o-Toxin—neutralizing antibodies rescue 
OTULIN haploinsufficiency 


Colonization and infection with S. aureus 
elicit o-toxin-neutralizing antibodies in an 
age-dependent manner (6/1). We hypothesized 
that o-toxin-neutralizing antibodies in the 
plasma of adult symptomatic and asymptomatic 
heterozygotes might contribute to the incom- 
plete clinical penetrance of OTULIN haploin- 
sufficiency (Fig. 1, B to D and F) (72). Routine 
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Fig. 6. OTULIN haploinsufficiency impairs intrinsic immunity to o-toxin 
in fibroblasts. (A) Colocalization of a-toxin with ADAMO and caveolin-1 in 


superimposed on that for healthy controls and 5p- patients with a breakpoint 
centromeric to OTULIN from (F) (N = 3, mean + SD). (F) Cell-surface 


PDFs, visualized in high-resolution images acquired by stochastic optical 
reconstruction microscopy, after 30 min of incubation in the presence or 
absence of a-toxin. (B) Binding of a-toxin in PDFs, as detected by flow 
cytometry. (C) Viability of PDFs after 2.5 hours of incubation with culture 
supernatant from S. aureus (data points indicate the mean of N = 3 replicates 
per patient, median per group). (D) Viability of PDFs after 24 hours of 
incubation with recombinant microbial toxins (N = 2 to 4 per group, mean + SD 
per group). (E) Viability of PDFs from a 5p— syndrome patient with a breakpoint 
telomeric to OTULIN after 24 hours of incubation with recombinant o-toxin, 


metalloprotease activity in PDFs 
mutant (H35L) after the treatme 
carrier (data points indicate the 
per group). (G) Viability of OTUL 
targeting CAV1 (sgCAV1), CAV2 ( 
(sgNTC) after exposure to a-toxi 


induced by wild-type a-toxin (WT) or a toxoid 
nt of cells with an ADAM1O inhibitor or 

mean of N = 3 replicates per patient, median 
IN-deficient PDFs treated with an sgRNA pool 
sgCAV2), or a nontargeting control sgRNA 

n (0.12 ug/ml; N = 3, median). The statistical 


significance of differences was assessed in Student's t tests (C), or by 


ANOVA with Bonferroni (F) or Du 
comparisons. See also fig. S6. 


nnett’s (G) post hoc corrections for multiple 


measurements of immunoglobulin (Ig) levels 
and vaccine responses revealed no defects in 
the patients (SM). Plasma o-toxin-neutralizing 
capacity in asymptomatic carriers was sim- 
ilar to that of healthy controls, whereas this 
capacity was significantly lower in patients 
(about one-fifth that in healthy controls) 
(Fig. 7A and fig. S7A). The lower neutralizing 
capacity in the patients was specific for a-toxin, 
because the plasma of the patients neutralized 
SLO and pneumolysin normally (Fig. 7A and 
fig. S7A). In healthy controls, plasma a-toxin- 
neutralizing capacity was largely driven by IgG 
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(fig. S7B). In patients, anti-o-toxin IgG levels 
were significantly lower than those in healthy 
controls, whereas IgA and IgM levels were 
unaffected (Fig. 7B and fig. S7, C and D). The 
anti-o-toxin IgG levels in asymptomatic car- 
riers were like those of healthy controls (Fig. 
7B and fig. S7, C and D) (67). Similar obser- 
vations were made after correction for sample- 
specific total IgG levels (fig. S7E). Thus, 
naturally elicited o-toxin-neutralizing anti- 
bodies can rescue OTULIN haploinsufficiency 
in vivo, thereby contributing to incomplete 
penetrance. 


17 June 2022 


Discussion 

We tested the hypothesis that some patients 
from our cohort with severe staphylococcal 
disease suffered from IEIs. Using an unbiased 
approach, we detected a genome-wide enrich- 
ment in very rare, deleterious heterozygous 
OTULIN variants in the cohort. We identified 
haploinsufficiency for OTULIN as a genetic 
etiology of severe staphylococcal disease. AR 
OTULIN deficiency results in life-threatening 
autoinflammation, manifesting early in life as 
ORAS (29-32). By contrast, patients with AD 
OTULIN deficiency typically presented with 
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Fig. 7. a-Toxin-neutralizing antibodies rescue OTULIN haploinsufficiency. 
(A) Capacity of plasma at a dilution of 1:300 to neutralize the hemolytic 
activity of a-toxin (20 ng/ml), pneumolysin (PLY, 0.67 ng/ml), and streptolysin 
O (SLO, 42 ng/ml) in rabbit erythrocytes. (B) Anti-a-toxin immunoglobulin 


severe disease triggered by S. aureus infec- 
tions. Most of the patients with OTULIN hap- 
loinsufficiency experienced their first episode 
of disease during adolescence. Necrosis was 
a clinical hallmark of disease in the pro- 
bands. Some patients were not diagnosed with 
S. aureus infections, suggesting that low-grade 
infectious or, perhaps, noninfectious triggers 
might be at work. AD OTULIN deficiency is 
clinically expressed in a tissue-specific man- 
ner, because the skin and/or lungs are the 
affected organs in all patients. The pedigrees 
reported in our study displayed variable degrees 
of penetrance and expressivity. Such observa- 
tions are common in situations of haploinsuffi- 
ciency (62, 63). We show that naturally elicited 
a-toxin-neutralizing antibodies can rescue 
OTULIN haploinsufficiency, and we suspect 
that declining levels of such antibodies may 
contribute to recurrent episodes of severe 
staphylococcal disease in some patients. We 
estimate the clinical penetrance of AD OTULIN 
deficiency at ~30%. This estimate of clinical 
penetrance mirrors our findings for individ- 
uals with 5p- syndrome. It is difficult to estab- 
lish causal relationships between individual 
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genes and specific phenotypes in patients with 
chromosomal deletion syndromes. Clinical 
manifestations of a monogenic disorder ina 
chromosomal deletion syndrome have also 
been reported for GATA3 haploinsufficiency 
(64). Our observations suggest that the study 
of rare IEIs can help to clarify phenotypes 
seen in individuals with more-common chro- 
mosomal abnormalities. 

In ORAS patients, autoinflammation results 
from the defective down-regulation of NF-«B- 
dependent inflammatory signaling in hemato- 
poietic cells, particularly those of the myeloid 
lineage (29-32). OTULIN deficiency causes a 
gene dosage-dependent accumulation of M1- 
Ub, but OTULIN haploinsufficiency is both 
clinically and biochemically silent for TNF- 
induced signaling events, instead causing 
a predisposition to infection. The apparent 
paradox of autoinflammation and immuno- 
deficiency within the spectrum of clinical phe- 
notypes of OTULIN deficiency is also observed 
in patients with HOIL1 and HOIP deficiencies, 
which greatly decrease the levels of M1-Ub 
(46, 65). The combination of autoinflamma- 
tion and immunodeficiency in HOILI1 and 
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levels in plasma at a dilution of 1:750. We analyzed plasma from adult 
patients, relatives, and controls. The statistical significance of differences was 
determined by ANOVA with Dunnett's post hoc correction for multiple 
comparisons. See also fig. S7. 


HOIP deficiencies highlights the complex role 
of LUBAC in maintaining the cell type-specific 
balance between inflammation and immu- 
nity. Inflammation is regulated by two other 
deubiquitinases in addition to OTULIN: A20 
and CYLD. Phosphorylated A20 preferentially 
cleaves K63-Ub in cells (66, 67). CYLD hydro- 
lyzes K63-Ub and M1-Ub (68). AD A20 defi- 
ciency causes an autoinflammatory condition 
resembling Behcet’s disease (69), whereas AD 
CYLD deficiency results in cylindromatosis of 
the skin (70). AR HOIL1, HOIP, and OTULIN 
deficiencies and AD A20 deficiency are sys- 
temic disorders. Conversely, AD OTULIN and 
CYLD deficiencies result in tissue-specific dis- 
ease. OTULIN and CYLD bind HOIP in a mu- 
tually exclusive manner (77). The detection 
of CYLD bound to M1-Ub reveals another, 
LUBAC-independent, layer of cross-talk be- 
tween the two deubiquitinases. CYLD activ- 
ity and specificity were recently shown to be 
regulated by phosphorylation and ubiquitin- 
binding CAP-Gly domains (72). We speculate 
that, with insufficient amounts of OTULIN, 
phosphorylated CYLD primed for activity 
toward K63-Ub is quenched by M1-Ub. The 
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accumulation of caveolin-1 in dermal fibroblasts 
observed in patients with OTULIN deficiency 
probably results from disruption of the poly- 
ubiquitination of this molecule. 

Human OTULIN haploinsufficiency is an 
inborn error of cell-intrinsic immunity to 
S. aureus infection. Staphylococcal disease 
displays a remarkable organ tropism, affect- 
ing the skin and lungs in particular (7). Tissue 
barrier disruption is a hallmark of severe 
staphylococcal disease (73). o-Toxin is a key 
staphylococcal virulence factor that contrib- 
utes to severe disease (73-76) and can injure 
nonhematopoietic cells (54, 73, 77, 78). The 
mechanism underlying the impairment of 
staphylococcal immunity in OTULIN haploin- 
sufficiency is reminiscent of that involved in 
cell-intrinsic immunodeficiencies conferring 
susceptibility to viral infections (79-81). In 
the absence of sufficient levels of a-toxin- 
neutralizing antibodies, susceptibility to staphy- 
lococcal disease in patients with OTULIN 
haploinsufficiency is driven by a defect of 
cell-intrinsic immunity to the o-toxin of the 
pathogen. In patients with OTULIN haploin- 
sufficiency, caveolin-1 accumulates specifically 
in nonhematopoietic cells. Caveolin-1 clusters 
with a-toxin, and the retention of ADAM10 
at the surface of the patients’ cells predis- 
poses these cells to o-toxin-induced cell death 
(82, 83). Caveolin-1 accumulation is observed 
in the cells of patients with either biallelic or 
monoallelic OTULIN deficiencies. The late- 
onset infectious phenotype in patients with 
AD OTULIN deficiency may be overshadowed 
by the early-onset severe autoinflammatory 
manifestations in ORAS or compensated for 
by the presence of a-toxin-neutralizing anti- 
bodies. Consistent with this view, we also 
identified a pediatric homozygous carrier of 
a moderately hypomorphic OTULIN variant 
with an infectious phenotype indistinguish- 
able from that of our haploinsufficient pro- 
bands, with no autoinflammatory features. 
This work provides a direct demonstration of 
a defined host characteristic predisposing hu- 
mans to adverse events elicited by a specific 
bacterial virulence factor. 

S. aureus is a pathogen of paramount sig- 
nificance in human health (J). In the face of a 
globally emerging epidemic of community- 
acquired methicillin-resistant S. aureus (MRSA) 
strains (J8), innovative strategies for treating 
and preventing infection are needed (84). In the 
presence or absence of documented S. aureus 
infections, OTULIN-haploinsufficient patients 
with severe necrosis may benefit from com- 
bined antibiotic and steroid therapy, together 
with o-toxin-neutralizing monoclonal anti- 
bodies (85). Despite clear public health needs, 
no effective S. awreus vaccine has yet been 
approved (86). Humans are the main reservoir 
of S. aureus, and the pathogen is highly adapted 
to its human host (77, 78). Mice are naturally 
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resistant to S. aureus infection (87). The prin- 
cipal mechanistic differences in hematopoietic 
immunity to staphylococcal infections between 
humans and mice have been characterized 
(77, 78). By contrast, the mechanisms of cell- 
intrinsic immunity in nonhematopoietic cells 
have been minimally studied in humans or 
mice (80, 88-90). Staphylococcal virulence fac- 
tors, such as a-toxin, are considered potential 
targets for strategies aiming to treat and pre- 
vent infection (77, 78). For many of these 
virulence factors, and cytotoxins in particular, 
one or more aspects of the interaction with 
the host are human-specific (78, 91). Murine 
ADAM10 can interact with a-toxin (55). How- 
ever, OTULIN haploinsufficiency reveals a layer 
of human specificity at the interface between 
a-toxin and its human nonhematopoietic tar- 
get cells. OTULIN haploinsufficiency demon- 
strates the contribution of S. awrews toxins 
to the pathophysiology of severe disease in 
humans. The experiment of nature described 
here identifies a mechanism of human cell- 
intrinsic immunity to S. aureus infections and 
highlights the potential for interfering with the 
staphylococcal o-toxin for treating and pre- 
venting disease (85, 92). 


Materials and methods summary 
Human subjects 


Informed consent was obtained from each 
patient, in accordance with local regulations, 
and a protocol for research on human subjects 
was approved by the institutional review 
boards of Institut National de la Santé et de 
la Recherche Médicale (INSERM, protocol 
C10-16) and The Rockefeller University (pro- 
tocols JCA-0698 and JCA-0695). 


Whole-exome sequencing, breakpoint 
determination, and variant enrichment analyses 


Genomic DNA was extracted from whole blood. 
Whole-exome sequencing and breakpoint 
determinations are detailed in the SM. A gene- 
based enrichment analysis was performed on 
the cohort of 105 patients with severe staphy- 
lococcal disease, together with 1274 individuals 
with Mendelian susceptibility to mycobacte- 
rial disease or suffering from tuberculosis as 
controls. Rare variants [MAF < 1 x 107° in the 
Genome Aggregation Database (gnomAD) (20)] 
predicted to be damaging [CADD > MSC (2)] 
were retained (table S1) (23). The proportions 
of individuals with rare damaging variants in 
the cohort and the controls were compared by 
logistic regression with likelihood ratio tests 
(table S2). The first five principal components 
of the PCA were systematically included in the 
logistic regression model to account for the 
ethnic heterogeneity of the cohorts, as previ- 
ously described (93). Genes were then tested 
under a dominant genetic model. Four carriers, 
three of whom carried heterozygous variants, 
were identified among the cases. No carriers 
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were identified among the controls. We then 
performed the same analysis again, but with 
the addition of 2504 individuals from 1000 
Genomes Project phase 3 (25) to our control 
dataset (26). 


Cell culture 


PDFs were obtained from skin biopsy speci- 
mens and cultured in Dulbecco’s modified 
Eagle’s medium (DMEM) (Gibco) supple- 
mented with 10% heat-inactivated fetal bo- 
vine serum (HI-FBS) (Gibco) unless otherwise 
specified. PBMCs were isolated from whole 
blood by density gradient centrifugation 
on Ficoll-Paque PLUS (GE Healthcare Life 
Sciences) and maintained in Roswell Park 
Memorial Institute (RPMI) 1640 medium 
(Gibco) supplemented with 10% HI-FBS un- 
less otherwise specified. Immortalized fibro- 
blasts were generated by transformation with 
SV40 as previously described (94) and main- 
tained in DMEM supplemented with 10% HI- 
FBS. HEK293T cells (American Type Culture 
Collection) were cultured in DMEM supple- 
mented with 10% HI-FBS. 


Generation of OTULIN constructs, transfection, 
transduction, and nucleofection 


The human canonical OTULIN cDNA open 
reading frame clone was amplified from 
pCMV6-Entry FAM105B (OriGene) and in- 
serted into the pCMV6-AN-Myc-DDK-tagged 
vector (OriGene) or pLentiIII-UBC (Abmgood). 
Site-directed mutagenesis to generate the 
OTULIN variants present in the patients was 
performed by a modified overlap-extension 
polymerase chain reaction (PCR)-based meth- 
od (95). All constructs were validated by Sanger 
sequencing. HEK293T cells were transiently 
transfected in the presence of Lipofectamine 
LTX (Thermo Fisher Scientific). For lentivi- 
ral transductions, semi-confluent HEK293T 
cells were cotransfected with OTULIN, GAG, 
POL, and ENV plasmids. The supernatant was 
collected at 48 and 72 hours and concentrated 
with a Lenti-X concentrator (Clontech). A 
10-fold dilution (by volume) of the lentivirus 
preparation was added to semi-confluent 
immortalized fibroblasts plated on six-well 
plates. Transduced cells were selected with 
puromycin (Invitrogen). PDFs were mixed 
with gene knockout kit V2 single guide RNA 
pools (Synthego) in the presence of TrueCut 
Cas9 protein V2 (Thermo Fisher Scientific), 
or with pCMV6 expression vectors, and nu- 
cleofected with the basic nucleofector kit for 
primary mammalian fibroblasts (Lonza) on a 
Nucleofector 2b device (Lonza). 


NF-xB inhibition assay and linear 
deubiquitinase assay 


HEK298T cells in 96-well plates were tran- 
siently cotransfected with pCMV6 vectors con- 
taining the OTULIN cDNA variants, the NF-«B 
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firefly luciferase plasmid pGL4.32, and the 
Renilla luciferase plasmid (pRLTK) as an 
internal control. The cells were stimulated, 
24 hours after transfection, by incubation with 
20 ng/ml TNF in minimal medium (1% FBS) 
for an additional 24 hours. The ratio of firefly- 
to-Renilla luciferase expression was assessed 
with the Dual-Glo Luciferase assay (Promega) 
on a bioluminescence plate reader. Data were 
normalized against the inhibitory activity of 
cells transfected with a vector encoding the 
OTULIN reference cDNA. For linear deubi- 
quitination assays, HEK293T cells were co- 
transfected with pCMV6 vectors containing 
the OTULIN cDNA variants, RBCK1, RNF31, 
SHARPIN, and linear ubiquitin (UBC) and 
harvested after 48 hours. Whole-cell lysates 
were prepared with WCL buffer A (table S4). 
For immunoprecipitation, anti-M1-Ub antibody 
was added, and the lysates were incubated 
overnight at room temperature. Dynabeads 
Protein G (Thermo Fisher Scientific) beads 
were added to the samples, which were then 
incubated for 1 hour at room temperature. The 
beads were then washed with WCL buffer A 
(table S4) and resuspended in 2 x NuPAGE 
LDS sample buffer (Thermo Fisher Scientific) 
supplemented with dithiothreitol (DTT). The 
antibodies used are described in table S3. 


Reverse transcription and PCR 


Total RNA was extracted from transiently 
transfected HEK293T cells or from PDFs with 
the RNeasy Mini Kit (QIAGEN). Reverse tran- 
scription was performed with the SuperScript 
III First-Strand Synthesis System (Invitrogen) 
and random hexamers. Quantitative PCR was 
performed with the TaqMan Universal PCR 
Master Mix (Applied Biosystems) in the 7500 
Fast Real-Time PCR System (Applied Biosys- 
tems). The following TaqMan Gene Expression 
assays (Thermo Fisher Scientific) were used: 
OTULIN (Hs01113237_m1), GUSB (Hs00939627_ 
m1), and GAPDH (Hs99999905_m1). 


Whole-cell lysates, SDS-PAGE, and Western 
blotting 


Transiently transfected HEK293T cells were 
harvested and lysed in WCL buffer B (table 
S4). PDFs were synchronized by incubation 
in DMEM containing 1% FBS for 24 hours 
before trypsin treatment and were then lysed 
in WCL buffer C (table S4) with sonication to 
clear the supernatants. Immortalized fibro- 
blasts were stimulated with 10 ng/ml TNF and 
lysed in WCL buffer D (table S4). Monocytes 
were obtained from freshly thawed PBMCs 
with the Pan Monocyte Isolation Kit (Miltenyi 
Biotec) and lysed in WCL buffer C (table S4) 
with sonication to clear the supernatants. 
Laemmli buffer (BioRad) supplemented with 
DTT was added to the clarified lysates. Pro- 
teins were separated by SDS-polyacrylamide 
gel electrophoresis (SDS-PAGE) transferred 
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onto an Immobilon-P PVDF membrane (Milli- 
pore). The membrane was blocked by incuba- 
tion in PBS supplemented with 0.1% Tween 20 
and 5% bovine serum albumin (BSA) and in- 
cubated overnight with the primary anti- 
body, followed by the appropriate horseradish 
peroxidase-conjugated secondary antibody. 
Immunoreactive proteins were visualized by 
enhanced chemiluminescence. Intensity analy- 
ses were performed with ImageStudioLite 
(LI-COR). The antibodies used are described 
in table S3. 


Cytometry by time-of-flight 


Freshly thawed PBMCs were incubated with 
Fe block and then with a panel of metal- 
conjugated antibodies, as described elsewhere 
(96). The antibodies used are described in 
table S3. Batch-integrated unsupervised cluster- 
ing analysis was conducted with the iIMUBAC 
pipeline (45). The analysis included 29 adult 
controls (nine studied in replicate) and nine 
pediatric controls (all under the age of 18 years) 
studied across seven batches of experiments, 
four patients with heterozygous mutations 
of OTULIN and two 5p- patients studied in 
two batches of experiments. Clusters were 
identified manually and renamed according 
to their marker expression patterns. 


Flow cytometry 


Freshly thawed PBMCs were rested for 3 hours 
and were then stimulated for 90 min with 
20 ng/ml TNF (R&D Systems) or 1 ng/ml LPS 
(InvivoGen). The reaction was stopped by 
washing the cells with PBS, and the cells were 
then subjected to live/dead staining. Cells 
were subsequently stained with surface mark- 
ers and washed. They were then permeabi- 
lized and stained with Phosflow Perm Buffer 
(BD) or Intracellular Staining Perm Wash 
Buffer (BioLegend). PDFs were synchronized 
by incubation in DMEM supplemented with 
1% HI-FBS for 24 hours and were then har- 
vested by trypsin treatment. The fibroblasts 
were stained, fixed, and acquired on an LSR- 
Fortessa cytometer (BD). All flow cytometry 
results were analyzed with FlowJo (version 10). 
The antibodies used are described in table S3. 


Cytokine detection 


PDFs were synchronized by incubation in 
DMEM supplemented with 1% HI-FBS for 
24 hours before stimulation. At steady state 
and after 6 hours of stimulation with 20 ng/ml 
TNF (R&D Systems), supernatants were har- 
vested, and the levels of IL-6 and IL-8 were 
assessed with the IL-6 and IL-8 Human enzyme- 
linked immunosorbent assay (ELISA) kits 
(Thermo Fisher Scientific). Freshly thawed 
PBMCs were left unstimulated or stimulated 
with 20 ng/ml TNF (R&D Systems), 10 ng/ml 
IL-1B (R&D Systems), HKSA (heat-killed S. 
aureus; MOI: 10; InvivoGen), or 1 ng/ml LPS 
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(InvivoGen) for 6 hours. Supernatants were 
harvested, and chemokines and cytokines 
were detected with LEGENDPlex Human In- 
flammation Panel 1 (BioLegend). 


Induced pluripotent stem cells 


The generation and culture of iPSCs, their 
differentiation into macrophages, and the 
phagocytosis and H,O, production assays are 
described in the SM. 


Transcriptional and proteomic analyses 


Transcriptional analyses in PDFs and PBMCs, 
single-cell RNA-sequencing on PBMCs, and 
global proteomics analyses of PDFs are de- 
scribed in the SM. 


Caveolin-1 immune purification from 
whole-cell lysates 


PDFs were synchronized by incubation in 
DMEM containing 1% HI-FBS for 24 hours 
and were then harvested by trypsin treat- 
ment. Cell pellets were lysed in WCL buffer C 
(table $4) with sonication to clear the super- 
natants. Cleared WCLs were incubated over- 
night at 4°C with Protein A Dynabeads (Thermo 
Fisher Scientific) conjugated to the monoclo- 
nal anti-caveolin-1 antibody or an isotype con- 
trol. The samples were then eluted from the 
beads with 50 mM glycine, pH 2. The anti- 
bodies used are described in table S3. The IPs 
were then split into portions for loading onto a 
gel or direct in-solution proteomic analyses, as 
described in the SM. 


Deubiquitination of purified caveolin-1 


Caveolin-1 IPs were treated with recombinant 
deubiquitinases as described elsewhere (97). 
Briefly, immunopurified caveolin-1 was incu- 
bated with deubiquitinases at 37°C for 30 min. 
Recombinant USP2, AMSH, and OTULIN 
(UbiCREST Deubiquitinase Enzyme Set, R&D 
Systems) were used at a final concentration of 
1x. Recombinant His-CYLD (R&D Systems) 
was used at a final concentration of 1 uM. 
Purified polyubiquitin chains were deubiqui- 
tinated by incubating tetra-M1-, K48-, and 
K63-Ub chains (R&D Systems) at a concentra- 
tion of 3 uM with deubiquitinases. After the 
enzymatic reactions, sample buffer containing 
DTT was added, and the proteins were sep- 
arated by SDS-PAGE, as described above. 


Linear ubiquitin-selective tandem Ub-binding 
entity assay 


For M1-Ub pull-down, immortalized fibro- 
blasts were lysed in WCL buffer E (table S4) 
containing FLAG-tagged M1-Ub-selective 
TUBE (LifeSensors). Supernatants were cleared 
by centrifugation and incubated with Anti- 
FLAG M2 agarose beads (Sigma Aldrich) for 
2 hours at 4°C. The beads were washed with 
WCL buffer E (table S4) and resuspended in 
1x Laemmli buffer without reducing agent. 
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For M1-Ub quantification, we used an input 
of ~0.5 mg per sample. For the identification 
of peptides bound to M1-Ub, we used an in- 
put of ~2 mg per sample. M1-Ub quantifica- 
tion and the identification of peptides bound 
to M1-Ub by means of AQUA-MS/MS are 
described in the SM. For Western blot analyses, 
beads were resuspended in sample buffer con- 
taining DTT, and samples were separated by 
SDS-PAGE, as described above. 


Immunofluorescence assays 


Immortalized fibroblasts were cultured on 
glass coverslips for the staining of M1-Ub, 
K63-Ub, and caveolin-1. The cells were fixed in 
methanol, blocked by incubation with 1% BSA, 
and incubated overnight with primary anti- 
bodies and CellMask deep red plasma mem- 
brane stain (Thermo Fisher Scientific), followed 
by secondary antibodies and 4’,6-diamidino-2- 
phenylindole (DAPI). Specimens were exam- 
ined either with an Imager Z1 microscope 
(Carl Zeiss, for M1-Ub aggregate number 
quantifications) or a LSM800 laser scanning 
microscope (Carl Zeiss, for caveolin-1 and M1- 
Ub or K63-Ub colocalization experiments). 
The numbers of ubiquitin aggregates per cell 
were quantified using CellProfiler (https:// 
cellprofiler.org/). Nuclei and cell boundaries 
were segmented to identify OTULIN-positive 
cells and to attribute M1-Ub-positive aggre- 
gates to parental cells. Immunofluorescence 
deubiquitination and competition assays are 
described in the SM. For o-toxin colocaliza- 
tion and binding experiments, PDFs were 
synchronized by incubation in DMEM con- 
taining 1% HI-FBS for 24 hours before trypsin 
treatment. PDFs were incubated with 10 ug/ml 
a-toxin-Cys-AF647, washed, and then fixed 
with methanol. After incubation with anti- 
bodies and DAPI, cells were seeded in a slide 
chamber. Images were acquired on a Leica SP5 
confocal microscope (Leica Microsystems) and 
NanoImager (ONI). In parallel, the cells were 
recorded on a FACS Verse flow cytometer 
(BD). Confocal images were analyzed with 
Image J Fiji software and the JACoP colocal- 
ization analysis plugin (98). The antibodies 
used are described in table S3. 


Cell viability assays 


PDFs were plated in a microtiter plate and 
synchronized in DMEM containing 1% HI- 
FBS and 25 mM DMEM for 24 hours and 
were then incubated for 2.5 hours with 5 or 
10% crude bacterial culture supernatant or 
for 24 hours with recombinant toxins, or with 
100 ng/ml TNF (R&D Systems) plus 1 uM BV6 
(ApexBio) at 37°C, under an atmosphere con- 
taining 5% COs. Bacterial supernatants were 
neutralized by incubation for 15 min in DMEM 
supplemented with 10% HI-FBS and 100 ug/ml 
monoclonal antibodies (92, 99). PBMCs and 
iPSC-derived macrophages were incubated 


Spaan et al., Science 376, eabm6380 (2022) 


with a-toxin for 2 and 24 hours, respectively, 
at 37°C, under an atmosphere containing 5% 
COs. For the inhibition of a-toxin cytotoxicity, 
PDFs were treated for 1 hour with 20 uM GI- 
254023X (R&D Systems) or dimethyl sulfoxide 
(DMSO) before adding 1.1 1.g/ml a-toxin, or for 
2 hours with 1 mM methyl-f-cyclodextrin 
(Sigma) followed by washing and the addition 
of 1.1 ug/ml a-toxin. Viability was assessed 
with the CellTiterGlo Luminescent Cell Via- 
bility Assay (Promega) and expressed rela- 
tive to cells not incubated with the toxin or 
supernatant. Four-parameter nonlinear regres- 
sion analyses were performed to obtain half- 
maximum effective concentrations (ECs). 
The expression of recombinant toxins and 
the generation of bacterial culture super- 
natant are described in the SM. 


Metalloprotease activity assays 


PDFs were plated in a microtiter plate and 
synchronized by incubation in DMEM con- 
taining 1% HI-FBS and 25 mM DMEM for 
24 hours. They were then treated for 1 hour 
with 20 uM GI-254023X (R&D Systems) or 
DMSO before the addition of 3.3 ug/ml wild- 
type or H35L-toxoid o-toxin. After 2 hours 
of incubation at 37°C, under an atmosphere 
containing 5% COs, cells were washed with 
25 mM Tris (pH 8) and 10 uM Mca-PLAQAV- 
Dpa-RSSSR-NH2 Fluorogenic Peptide Substrate 
(R&D Systems) was added. After incubation for 
30 min, fluorescence intensity was measured on 
a monochromator-based microplate reader and 
expressed relative to cells not incubated with 
the toxin. 


Mice 
The generation and use of Otulin-haploinsufficient 
mice are described in the SM. 


Anti-o-toxin immunoglobulins 


The capacity of the plasma to neutralize mi- 
crobial toxins was assessed with rabbit eryth- 
rocytes, as described in the SM. Pooled plasma 
was depleted of IgG with HiTrap Protein G 
HRP columns (GE Healthcare). Anti-a-toxin 
immunoglobulin levels were measured by 
ELISA, as described in the SM. 
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ROBOTIC MATERIALS 


Design and printing of proprioceptive 
three-dimensional architected robotic metamaterials 


Huachen Cui, Desheng Yao’, Ryan Hensleigh’, Haotian Lu2, Ariel Calderon’, Zhenpeng Xu?, 


Sheyda Davaria*, Zhen Wang", Patrick Mercier’, Pablo Tarazaga°, Xiaoyu (Rayne) Zheng 
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Advances in additive manufacturing techniques have enabled the creation of stimuli-responsive 
materials with designed three-dimensional (3D) architectures. Unlike biological systems in which 
functions such as sensing, actuation, and control are closely integrated, few architected materials have 
comparable system complexity. We report a design and manufacturing route to create a class of 
robotic metamaterials capable of motion with multiple degrees of freedom, amplification of strain in a 
prescribed direction in response to an electric field (and vice versa), and thus, programmed motions with 
self-sensing and feedback control. These robotic metamaterials consist of networks of piezoelectric, 
conductive, and structural elements interwoven into a designed 3D lattice. The resulting architected 
materials function as proprioceptive microrobots that actively sense and move. 


dvances in additive manufacturing tech- 

niques have enabled the fabrication of 

materials with ultrahigh stiffness (7) and 

damage tolerance (2, 3), exotic mechani- 

cal behaviors (4, 5), negative thermal ex- 
pansion coefficients (6), fluid control (7), and 
wave transmissions (8). These materials can be 
digitally designed by placing structural elements 
as bits and atoms in a three-dimensional (3D) 
layout. The potential of architecting electronic 
and multifunctional materials and their com- 
binations is still elusive, as most additive fab- 
rication techniques have focused on single 
structural materials such as polymers, metals, 
and ceramics. Architecting functional mate- 
rials could offer exotic and configurable prop- 
erties in functional space, such as directional 
sensing (9), morphing (JO), and reconfigur- 
ability (11, 12). 

Piezoelectric materials, which are capable of 
converting an electrical field into mechanical 
strain and vice versa (73), are ideal candidates 
for providing both sensing and actuation func- 
tions in a robotic system. These materials are 
widely employed in precision actuators (J4), 
manipulators (75), accelerometers (6), and 
tactile sensors (17, 18) to generate robotic mo- 
tions and sensing feedback. However, their 
electric field-induced strain, as a result of 
asymmetric displacement of ions in crystal 
dimensions, is derived from and thus limited 
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by the naturally existing crystal structures, 
leading to a magnitude of <0.5% [i.e., the 
strain range in lead zirconate titanate (PZT) 
ceramic and single-crystalline lead magne- 
sium niobate-lead titanate (PMN-PT) is <0.1 
and <0.5%, respectively] in only the normal 
and shear directions (13). To use piezoelectric 
materials as transducers in robotic systems, 
the manufacturing process must involve ex- 
tensive processing and assembly steps—such 
as ceramic machining (19), lamination (20), 
attaching planar electrodes for both activa- 
tion and driving (21), and integration with 
transmission mechanisms (22)—to amplify 
the piezoelectric strain and transform the 
strain into motion in desired directions. These 
manufacturing routes usually process solid 
piezoelectric materials only and cannot pre- 
cisely pattern electrodes, making it difficult 
to reduce the weight of an actuation element 
and activate bidirectional piezoelectric effects 
at small scales (23). 

We show that a two-way conversion from an 
electric field to an arbitrary mechanical strain 
in a prescribed direction (and vice versa) can 
be achieved via a 3D network of interconnected 
piezoactive, conductive, and structural phases 
(Fig. 1A). Through analytical, numerical, and 
experimental validations, we show that these 
3D microarchitected piezoelectric materials 
exhibit piezoelectric strain constants that can- 
not be achieved in standard materials and 
result in a wealth of electric field-induced 
strain conversions—including tension, shear, 
twisting, and flexure degrees of freedom (DoFs) 
and their combinations—as well as amplifi- 
cation of strain and logic-related subtraction 
and addition of strains. We developed a charge- 
programmed multimaterial additive manufac- 
turing technique capable of directly assembling 
conductive, piezoactive, and structural mate- 


rials into a complex 3D microarchitecture, 
which leads to a class of robotic metamate- 
rials. We use the term “robotic metamate- 
rials” to represent materials with designed 
3D microarchitectures that directly serve 
as microrobots capable of performing many 
robotic tasks, including locomotion, steering, 
stepping, and two-way sound and ultrasonic 
transductions, as well as decision-making via 
feedback control (24). 


Rational design of robotic metamaterials with 
arbitrary strain modes 


The core concept of architected materials is 
the unrestricted placement of materials in a 3D 
cellular topology that either bypasses limita- 
tions inherent in natural crystals or mimics 
them to achieve desired properties. We intro- 
duce a convenient and robust strategy to ar- 
chitect piezoactive, conductive, and structural 
phases (Fig. 1A) in 3D space. Such multimaterial 
metamaterials are capable of receiving an 
input electric field and outputting a desired 
mode of strain that includes DoFs beyond the 
Cauchy strain components (normal and shear 
strains) (25), such as normal, shear, twisting, 
and flexure modes as well as their combina- 
tions and amplifications. 

Because the existing piezoelectric tensors 
are not sufficient to describe all DoFs, we 
define the generalized piezoelectric tensors 
dnm and dnm on the basis of Cosserat solids 
(26) to describe the strain conversions of the 
architected piezoelectric materials (Fig. 1B) 


Em = dnmEn; Qn = Onin Er (1) 


where E,, is the electric field along the n di- 
rection in Cartesian coordinates (7 = 1 to 3); 
€m and @,,, (m = 1 to 6) are the directional strain 
and coupled strain tensors (25), respectively; 
dnm is the piezoelectric tensor that describes 
only the correlation between the normal or 
shear strain and the electric field; and dnm is 
the extended piezoelectric tensor with com- 
ponents that denote the additional strain 
modes, including twist and flexure. 

To design a piezoelectric microarchitecture 
that displays a desired global strain mode dy», 
(or dnm) (Fig. 1B; see movie S1 for 36 distin- 
guished strain modes), we start by identifying 
the motion of a stack of virtual characteristic 
planes within a unit cell of the microarchi- 
tecture (Fig. 1, C and D) and the local strain of 
the piezoactive struts that make up the unit 
cell (Fig. IE and movie S1). The virtual char- 
acteristic planes can be considered “pinned” 
by the piezoactive struts to allow unconstrained 
motion and display a motion representing the 
desired global strain. As shown in fig. S1, A 
to D, the characteristic planes undergo dis- 
tance changes, slip, rotation, and tilt corre- 
sponding to strain modes such as normal strain 
(Em, With m = 1 to 3), shear (€;,, with m = 4: to 6), 


lof7 


RESEARCH | RESEARCH ARTICLE 


A Micro-architected Piezo-active phase B; 
robotic metamaterial M 


Conductive 
phase 
‘ 


dim 


‘ 
dz, da 
d3, d32 

C__ Select strain mode (i. e. d33) 

Driving Electrode 

voltage — —, 


Architected material with Tessellate unit cell based on Integrate structural and 


prescribed strain mode structural symmetry 


J Amplified expansion (d33) 
Mode selection 


Localized Piezo-strain 
electrodes / | 


i L 


Unit cell design 7 


Tessellation 


i ae 
Structural phase dia dig Ga Gia tis Gee 
do3 do4 dos dy6 11 doa dz2 d33 do4 das do6 
d33 d3q4 dgs d3o!d3; d3z d33 d3q d35 dye 


D Motion of characteristic planes E 


ompound model 
i 
t 


a 
"diy dia dis dia dis dig 


Strain of piezo-struts 


Localized 
: electrodes 
i \} \ 
: External 

: eldctrodes 

i /t 


Polarization e 


Electric field 


Identify global and local strain 


Construct unit cell 
Plane rotation 


G 


Place piezo-active strut groups 

conductive phase based on plane motion 
K Compound shear and expansion with 

amplification (d33 & d35) 

eo Group (1 

Mode selection Ss 


Shear 


Group (2 \ 
&expansion 


Individually activated ¢ 
Oma = 
Shear 
Inverse electric field 
Unit cell design @ => | | “Tessellation 
<+ _ 


Fig. 1. Rational designs of robotic metamaterials with arbitrary strain modes. (A) Schematic of the 
piezoelectric metamaterial consisting of piezoactive, conductive, and structural phases. (B) Schematic of various 
strain modes and a piezoelectric strain matrix with extended tensors (red), enabled by metamaterial design 
and compared with the 18 tensors in natural piezoelectric ceramics (blue). (C to 1) Schematic of the design 
rationale for the piezoelectric metamaterials. The example shown here is for the d33 (twisting) mode. (J) Designs 
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twist (¢,,, With m = 1 to 3), and flexure (@,,, 
with m = 4 to 6). Figure 1D shows an example 
of a cube that twists upon application of an 
electric field in the direction of the 3-axis (d33 
mode), which is represented by the in-plane 
rotation of the characteristic planes (Fig. 1D). 
The local strain of a piezoactive strut within 
the unit cell, either expanding or contracting, 
is determined by the direction of the strut, 
polarization, and electric field (see materials 
and methods for details on local strain iden- 
tification). The electric field is generated by 
a conducting phase that either covers the 
sides of the lattice topology [external electrodes 
(Fig. 1E, ())] or penetrates the topology [lo- 
calized electrodes (Fig. 1E, @))], covering both 
sides of the active struts and resulting ina 
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shorter distance between electrodes and a 
stronger electric field. 

The next design step is to place the piezo- 
active struts in a spatial layout such that their 
local strain drives the virtual characteristic 
planes to display the motion path correspond- 
ing to the target global strain mode (dpm or 
dnm) (movie S1). Figure IF presents the design 
of the piezoactive struts that generate a clock- 
wise rotation of the characteristic planes, 
resulting in a twist strain mode and corre- 
sponding to a nonzero piezoelectric twist co- 
efficient, d33. The 3D microarchitectural layouts 
of piezoactive struts for other strain modes 
(including expansion, shear, and flexure) are 
summarized in fig. S1, E to H. The full unit cell 
design is then completed by adding a struc- 
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tural phase and a conductive phase (Fig. 1G) 
in a layout that matches the symmetry of the 
piezoactive struts (see materials and methods 
for details on different phases). 

In the last step, we tessellate the unit cells 
into a “metacrystal” that reflects the effective 
response of the unit cell (see materials and 
methods and fig. S1, I to L, for the choice of 
tessellation orientations). For example, to gen- 
erate a twisting metamaterial, we tessellate 
the unit cell in cylindrical coordinates along 
the radial (r), angular (0), and height (z) di- 
rections (Fig. 1, H and I, and movie S1). This 
aperiodic tessellation is scalable to an infinite 
number of unit cells and bypasses the effect of 
the number of unit cells when tessellated in 
Cartesian coordinates, for which the twist 
strain would vanish when reduced to a Cauchy 
continuum as the unit cell number increases 
(27) (fig. S2). As such, these metamaterial 
concepts are scalable, and the coefficient is 
invariant of the number of unit cells in all 
directions. Designs with all strain modes 
shown in the extended matrix are summarized 
in table S1. 

The interpenetrating piezoactive, structural, 
and conductive phases allow amplification 
and logic-related subtraction and addition of 
strains, leading to the coupling and suppres- 
sion of selected strain modes. Figure 1J shows 
the unit cell design featuring amplified expan- 
sion by tessellating pairs of piezoactive and 
structural struts (28). Figure 1K demonstrates 
the design with localized electrodes. The local- 
ized electrode architecture that covers selected 
groups of struts allows for programming of the 
polarization and electric field direction within 
the unit cell (Fig. 1K; see fig. S3 for polariza- 
tion and driving voltage programming), thereby 
achieving a compound strain mode with both 
shear and expansion strain output or a de- 
coupled strain mode with doubled shear strain 
and suppressed expansion. 

We implemented a computational frame- 
work that predicts and verifies the piezoelectric 
coefficients—dym and dnm, respectively—of 
the metacrystal designs. This is achieved by 
calculating the total induced-load contribu- 
tion of the connected piezoactive struts within 
the cubic volume under an electric field [ma- 
terials and methods and supplementary text 
section 1 (SM 1)]. SM 2 and fig. S4 show the 
finite element analysis results that verify the 
actuation modes. 


Additive manufacturing of robotic 
metamaterials 


We developed a charge-programmed multi- 
material additive manufacturing technique 
capable of assembling piezoactive, structural, 
and conducting phases into a complex 3D 
microarchitecture. First, a negatively charged 
resin [which can be selectively deposited with 
metals (29)] and a highly loaded nanoparticle 
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Fig. 2. Multimaterial fabrication platform. (A and B) Schematic illustrations 
of the charge-programmed multimaterial additive fabrication of 3D piezoelectric 
ceramics with embedded electrodes. UV, ultraviolet. (© and D) Schematics of 
the sintering process of highly loaded piezoelectric ceramics with localized 
electrodes. (E) Schematic of the selective activation (polarization) process of the 
piezoactive phase. (F to 1) Optical images of as-fabricated samples. (J) Scanning 
electron microscopy (SEM) image of the cross section of the sintered PZT. 


(K) Optical image of a printed piezoelectric metamaterial block with embedded 
3D electrodes. (L) SEM image of the boundary between the PZT and Au 
electrode. (M) Elemental mapping of a selectively deposited thin wire, showing 
the minimum deposition width (W). (N) Comparison of piezoelectric charge 
coefficients between piezoceramics presented in this study and current 
3D-printed piezoelectric materials (32-38). (0) Conductivity of Cu and Au 
conductive phases before and after sintering. 


colloid were printed via the multimaterial 3D 
printing system (materials and methods), as 
shown in Fig. 2A. The nanoparticle colloid 
consists of 15 to 50 vol % PZT (APC 855), 
featuring a high piezoelectric strain constant 
(d33 = 600 pm/V) and 5 to 15% lead nitrate, 
which is used as a liquid-phase-sintering 
agent (30). The conductive phase was then 
selectively deposited onto the charged resins, 
forming a 3D microarchitecture with elec- 
trodes (Fig. 2B and materials and methods). 
The deposited architectures were cosintered 
with an optimized temperature profile in a gas 
environment (fig. S5) to prevent degradation 
of the piezoelectricity of the piezoceramics and 
conductivity of the electrodes (Fig. 2C). Ad- 
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ditionally, lead oxide was used to provide a 
liquid seal and lead-rich environment to sup- 
press lead evaporation of PZT at temperatures 
higher than 800°C (Fig. 2D and SM 3). The 
lead nitrate particles within the printed archi- 
tecture melted and induced contraction of the 
PZT particles as a result of their surface ten- 
sion, which densified the PZT and reduced the 
porosity. Next, the architected piezoceramics 
were polarized by applying a strong electric 
field through the deposited metals at an ele- 
vated temperature (SM 3 and fig. S5). After 
polarization, the regions not covered with 
electrodes remained inactive (unpolarized) 
and were employed as the structural phase 
(Fig. 2E). Other ceramics, such as silicon oxy- 


carbide (SiOC) (37), could also be employed as 
the structural phase to enhance the stiffness of 
the metamaterial. 

This 3D fabrication approach allows the 
fabrication of piezoactive materials with pre- 
cise, microscale 3D architectures and low 
porosity (Fig. 2, F to H). Figure 2K shows a 
robotic metamaterial block with embedded 
electrodes. The gapless bonding between the 
piezoactive and conductive phases (Fig. 2L) 
results in minimal dielectric loss when an elec- 
tric field is applied. The minimum achievable 
feature size of the conductive phase reaches 
20 um (Fig. 2M), ensuring precise control of 
the localized electric field when voltages are 
applied through these conductive phases. The 
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as-printed piezoelectric solids reach a measured 
piezoelectric constant (d33) as high as 583 pm/V 
(measured with a d33 meter, APC 90-2030), in- 
dicating that these materials outperform exist- 
ing 3D printable piezoelectric composites or 
ceramics (Fig. 2N) (32-38). The selectively 
patterned conductive phases are highly con- 
ductive (SM 4), achieving values of 3.7 x 10° 
and 1.35 x 10” S/m (where 1S = 1 A/V) after 
sintering for Cu and Au, respectively (Fig. 
20), which makes them suitable for sub- 
sequent actuation with negligible energy loss. 
The surface texture and roughness of differ- 
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ent materials of the as-fabricated samples are 
summarized in SM 6 and fig. S7. See SM 5 and 
fig. S6 for the characterization of mechanical 
properties of the as-printed lattice materials, 
including strength, elastic modulus, and du- 
rability. The high-precision fabrication, highly 
responsive 3D-printed PZT, and highly conduc- 
tive deposited metals enable the implementa- 
tion of metamaterial designs. 


Multi-DoF amplified and programmed strain 


To verify the designed strain modes of the ro- 
botic metamaterials, we measured the electric 
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field-activated deformation of the as-fabricated 
and polarized samples via high-precision 
full-field scanning laser Doppler vibrometry 
(Polytec, PSV-500) (Fig. 3A). A sinusoidal sweep 
signal with 50-V amplitude was applied through 
the layered external electrodes, whereas the 
activated deformation was measured by tracing 
the side surface of the metamaterial (Fig. 3A) 
and reconstructed as operational deflectional 
shapes in the quasi-static region for visualiza- 
tion. Figure 3, B and C, displays an optical image 
of a twist (d33) mode lattice with external 
electrodes and its side-surface deformation, 
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Fig. 3. Experimental verification of robotic metamaterial designs. (A) Schematic —_ with localized electrodes and solid PZT material with external electrodes as a 


of the test setup with a laser vibrometer to measure the deformation of the function of the input voltage. (L) Optical image of the piezoelectric metamaterial 
side surface of the lattice. (B to G) Optical images of the twist, shear, and with amplified twist and suppressed expansion. (M) Twist angle of the metamaterial 
flexure mode lattice and their side-surface deformation. (H) Optical image of a with external and localized electrodes as a function of input voltage. (N) Optical 
modularized piezoelectric actuator with six DoFs. (I) Deformation of the side image of the piezoelectric metamaterial with amplified shear. (0) Shear 
surface of the 6-DoF piezoelectric actuator working in three representative displacement of the metamaterial with external electrodes and localized 
operating modes (d34, d3s, and d33 modes). (J) Optical image of a piezoelectric electrodes as a function of input voltage. (P and Q) Displacement of compound 
metamaterial with amplified expansion. (K) Displacement of the metamaterial and decoupled expansion-shear modes with sinusoidal voltage input. 
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showing a wrapped shape that corresponds 
to a twisted solid (Fig. 1C). Operational deflec- 
tional shapes for shear (d34,) and flexure (d35) 
mode lattices are summarized in Fig. 3, D and 
E, and Fig. 3, F and G, respectively. Different 
strain modes can be combined into a single- 
element piezoelectric lattice to achieve a multi- 
DoF metamaterial actuator with six individually 
actuatable architecture designs with six DoFs, 
including the d33, d34, d35, d33, daa; and dss 
modes (Fig. 3, H and I, and movie 82). This 
multi-DoF actuator has a density of 0.88/cm?, 
which is an order of magnitude lower than 
that of existing piezoactuators with dense PZT 
(23) (fig. S8). We demonstrate a high-speed 
galvanometer actuated by two multi-DoF ac- 
tuators assembled with two reflective mirrors 
(movie 83). Figure S9 shows a star pattern 
drawn by the guided laser beam (10 cycles at 
20 Hz, root mean square precision of 50 + 13 um). 
This is achieved by programming the input 
voltage that controls the tilt angle of the meta- 
material (see SM 7 for details of the laser pat- 
tern reconstruction). 

We fabricated microarchitectures with em- 
bedded electrodes and demonstrated strain 
amplification, strain compounding, and strain 
addition and subtraction enabled by the local- 
ized electrodes and microarchitecture design. 
Figure 3, J, L, and N, present as-fabricated 
lattices with localized electrodes that feature 
amplified expansion (d33; Fig. 1J), amplified 
twist (d33; fig. S3), and amplified shear (d34; 
Fig. 1K), respectively. The displacement versus 
the driving voltage of the lattices was measured 
by the laser vibrometer and plotted (Fig. 3, K, 
M, and O) to derive the corresponding strain 
coefficients. Compared with the strain coef- 
ficients of lattices with layered external elec- 
trodes, the piezoelectric coefficients of the 
localized electrodes (i.e., d33 = 10,050 pm/V) 
are amplified, with values two orders of mag- 
nitude higher than that of their native material 
(583 pm/V). The compound expansion-shear 
strain and the decoupled pure shear strain are 
demonstrated, using designs from Fig. 1K, by 
programming the local polarization and driv- 
ing electric field within the embedded electrode 
architectures (Fig. 3, P and Q). 


Robotic metamaterials as proprioceptive 
microrobots 


These piezoceramic architectures have bi- 
directional electrical-to-mechanical energy 
conversion—namely, direct and inverse piezo- 
electric effects. These metamaterials can be 
exploited as a proprioceptive microrobotic sys- 
tem with feedback control and without the 
need to combine transmissions and external 
sensors. Within a single piece of piezoelectric 
architecture, selected regions serve as actua- 
tion limbs using the inverse piezoelectric effect, 
whereas other regions that use the direct and 
bidirectional piezoelectric effects serve as strain 
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sensors and pulse-echo elements, respectively 
(Fig. 4A). 

We fabricated one piece of metamaterial 
with an amplified d33 mode (Fig. 1J) as a mul- 
timodal microrobot (Fig. 4B; see SM 8 and figs. 
$10 and S11 for an alternative version consist- 
ing of compound flexure-twist mode lattices). 
Selected regions serve as robotic limbs capa- 
ble of actuation and load carrying, with the 
ability to sense both contact forces and remote 
objects. The microrobot is controlled by a 
closed-loop system via its sensing components 
(fig. $12). The microrobot consists of three in- 
dividually actuatable regions that serve as its 
body, rear legs, and front legs (Fig. 4B). The 
coordinated motion of these three regions 
enables three distinguishable modes described 
as walking, turning, and jumping (Fig. 4B; see 
materials and methods and fig. S14, A and B, 
for motion planning based on the three limbs). 
The 3D localized electrode interconnects all 
piezoactive elements within one region and 
minimizes the number of connection leads, 
enabling a compact design (without additional 
circuit boards or leads wired to individual 
piezoactive elements) and making the micro- 
robot lightweight (740 mg). 

In contrast to existing piezoelectric actua- 
tion mechanisms in which leveraging or trans- 
mission methods are employed to amplify 
displacement at the cost of reducing blocking 
force, a feature of the actuation performance of 
the robotic metamaterial is its simultaneously 
increased blocking force and displacement 
via its skeletal actuator layout. An actuator 
connected with a transmission system for 
robotic motions typically leads to a trade-off 
between blocking force and step length, owing 
to the compliant levering mechanism involved 
(39, 40). We show that these architected piezo- 
electric robotic limbs (Fig. 4A), composed of a 
network of microscale actuation units distrib- 
uted throughout the robot body in a fashion 
similar to skeletal muscles, breaks this trade- 
off and simultaneously produce long steps 
and high blocking force with high system stiff- 
ness. A comparison between the skeletal actua- 
tion architecture and actuator-transmission 
mechanism is described in fig. S13 and SM 9. 
Although a transmission system effectively 
amplifies the displacement, its additional 
levering inevitably reduces the system stiffness 
and therefore the blocking force (Fig. 4C; N 
denotes the number of unit cells of the meta- 
material block), resulting in a decreased pay- 
load capability of the robot when driven at 
higher speeds. The interconnected microactua- 
tor and embedded electrode microarchitecture 
allow the driving voltage to be applied locally at 
each element, enabling a higher driving electric 
field and thus an amplified displacement. This 
is achieved while maintaining the overall stiff- 
ness through the interconnections of the unit 
cells without compliant transmissions. These 


features enhance the performance of the mi- 
crorobot, including high speed, payload, and 
resonance. 

The long steps and high resonance of the 
metamaterial limb enable a maximum speed 
of 128 mm/s in the walking mode and a turn- 
ing rate of 90°/s (Fig. 4D and movie S4; see 
fig. S14 for the frequency response of the 
limb and speed benchmark). With its broad- 
band response (fig. S14), the robotic meta- 
material achieves a short rise time to impulse 
voltage inputs (47). This enables the release 
of actuation energy within 1 ms, resulting in 
the microrobot lifting its front legs into the 
air and then lifting its body and rear legs, thus 
allowing it to step up from the ground (movie 
S4). The jumping mode enables the micro- 
robot to climb over elevated and rough surfaces 
(Fig. 4E and movie S4), movement that is not 
achievable by piezoelectric microrobots with 
small strokes (42, 43). The stiffness of the 
piezoelectric metamaterial elements enables 
a large system payload. Various weights (1 to 
5.6 g) are loaded on the microrobot, and its 
speed is measured by digital image correla- 
tion. More than 80% of the original speed is 
retained for a 500% payload, compared with a 
>80% speed reduction with a <200% payload 
for microrobotic systems using a compliant 
mechanism (Fig. 4F) (39, 44). 

By assigning sections of the piezoactive 
phases as the sensing element and using its 
direct piezoelectric effects via 3D embedded 
electrodes, we demonstrate that the robotic 
metamaterial is capable of self-sensing its strain 
change and responding to external stimuli, 
which allows closed-loop control and quick 
reactions to external stimuli. As shown in 
Fig. 4B, the selectively deposited electrodes 
separate a strain-sensing section from a piezo- 
active element with a 100-um isolation gap, 
which eliminates electrical coupling. This sec- 
tion is integrated with a microcontroller to 
form a closed-loop control system that receives 
the sensing voltage and sends the control volt- 
age to the actuation regions. The self-sensing 
voltage signal updates as the actuator element 
deforms, allowing real-time monitoring of 
the gait status and response to stimuli (see 
fig. S12 for control loop design; see SM 10 
and fig. S15 for more details on proprioception 
and exteroception). Figure 4G demonstrates 
the gait-monitoring signal, detection of the 
external impact, and programmed reaction 
within 0.1 ms upon a drop-weight impact 
during locomotion (SM 11). Movie S5 shows 
other programmed reactions that include stop- 
ping or turning upon detecting an obstacle. 
The high payload of the metamaterial-enabled 
robot allows for the integration of an onboard 
power (7.4 V), a driver, and a microcontroller, 
paving the way for fully untethered operations 
with closed-loop control (SM 12, fig. S16, and 
movies S6 and S7). 
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The robotic metamaterial is also capable of 
sensing and reacting to noncontact stimuli 
when the pulse-echo element functions with 
bidirectional piezoelectric effects. As shown in 
Fig. 4B, an element on a frontal leg is capable 
of emitting and receiving focused ultrasonic 
waves with a semispherical acoustic lens printed 
with the structural phase. The echo signal (fig. 
S17 and materials and methods) from the re- 
mote object is processed by the closed-loop 
control system that updates the driving voltage 
to navigate through an S-shaped path and 
avoid remote obstacles (Fig. 4, H and I, and 
movie S8). 


Discussion and outlook 


We introduced a strategy to design and con- 
struct a class of robotic metamaterials that 
incorporate electronic, structural, and con- 
ducting microscale strut elements in a 3D 
architecture. These multifunctional meta- 
materials revealed a myriad of strain modes, 
including twisting, flexure, compound, de- 
coupled, and amplified strains. The micro- 
architecture designs surpass the limitations of 
natural piezoelectric crystals in which the piezo- 
electric strain relies on the available natural 
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crystalline structures. The design strategy can 
be combined with a topology-optimization algo- 
rithm to generate arbitrary piezoelectric tensors. 

The multimaterial additive manufacturing 
technique reported herein combines piezocer- 
amic, metallic, and structural materials into a 
complex 3D architecture. The resulting meta- 
material blocks with milli- to centimeter di- 
mensions are capable of outputting multi-DoF 
motions with high blocking force as well as 
sensing contact and remote stimuli. This is 
accomplished without any external sensors 
and transmissions. Compared with other ac- 
tuation materials (e.g., dielectric elastomers), 
the robotic metamaterials have low driving volt- 
age, broad frequency range, two-way sensing, 
and actuation. The reported design framework 
and manufacturing methods have direct im- 
plications for the future development of 
miniaturized robots, transducers, and robotic 
materials; it will be possible to achieve desired 
motions and decision-making via simplified 
artificial materials. 
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A genetic module at one locus in rice protects 
chloroplasts to enhance thermotolerance 
Hai Zhang*?, Ji-Fu Zhou’, Yi Kan’, Jun-Xiang Shan'?, Wang-Wei Ye", Nai-Qian Dong*’, 


Tao Guo’, You-Huang Xiang", Yi-Bing Yang>*, Ya-Chao Li?*4, Huai-Yu Zhao’, Hong-Xiao Yu*’, 
Zi-Qi Lu'*, Shuang-Qin Guo’, Jie-Jie Lei'?*, Ben Liao“, Xiao-Rui Mu**, Ying-Jie Cao’, 


Jia-Jun Yu'*, Youshun Lin®*, Hong-Xuan Lin 


1,2,3,4% 


How the plasma membrane senses external heat-stress signals to communicate with chloroplasts 

to orchestrate thermotolerance remains elusive. We identified a quantitative trait locus, Thermo-tolerance 3 
(TT3), consisting of two genes, T7T3.1 and TT3.2, that interact together to enhance rice thermotolerance 
and reduce grain-yield losses caused by heat stress. Upon heat stress, plasma membrane-localized E3 ligase 
TT3.1 translocates to the endosomes, on which TT3.1 ubiquitinates chloroplast precursor protein TT3.2 for 
vacuolar degradation, implying that TT3.1 might serve as a potential thermosensor. Lesser accumulated, 
mature TT3.2 proteins in chloroplasts are essential for protecting thylakoids from heat stress. Our findings 
not only reveal a TT3.1-TT3.2 genetic module at one locus that transduces heat signals from plasma 
membrane to chloroplasts but also provide the strategy for breeding highly thermotolerant crops. 


lants have developed multiple resistance 

mechanisms (/-3), including the ubiq- 

uitin proteasome system (4-7), and chlo- 

roplast stress responses (8-17) to buffer 

themselves against temperature fluctua- 
tions. The process of ubiquitination plays a key 
role in eliminating denatured proteins and 
thus improves plants thermotolerance (6, 12). 
Chloroplasts are photosynthetic organelles 
that also contribute to heat-stress responses 
(8, 10, 13, 14). Chloroplast proteins damaged 
by stress need to be rapidly degraded through 
chloroplastic hydrolases or extraplastidic path- 
ways (26S-proteasome and autophagy), which 
are well described (15-17). However, how heat 
signals are transduced from the plasma mem- 
brane into chloroplasts to control plant thermo- 
tolerance remains unknown. We found that 
a Thermo-tolerance 3.1 (TT3.1)-TT3.2 genetic 
module in rice links the plasma membrane to 
chloroplasts to protect chloroplasts from heat- 
stress damage and increases grain yield under 
heat stress. 


Results 
Identification of TT3 locus for thermotolerance 


To identify potential quantitative trait loci 
(QTLs) conferring rice thermotolerance, we 
tested a set of chromosome segment substitu- 
tion lines with the African rice variety CG14 as 
the donor parent and the Asian rice variety 
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Wuyunjing (WYJ) as the recurrent parent (fig. 
S1A) (18). We selected a QTL, designated 773, 
for further fine-mapping and delimited TT3 
to a 36.6-kb region (fig. S1, B to D). We then 
constructed the nearly isogenic line (NIL) that 
contains this region and carries TT3 by back- 
crossing with WYJ (NIL-T73@), NIL-TT3¢@* 
was more tolerant than its isogenic control, 
NIL-77T3"”, upon heat stress (Fig. 1, A and B), 
indicating that 7T3 is a QTL contributing to 
rice thermotolerance. 

In the 773 region, three genes were anno- 
tated: LOC_Os03 g49900, LOC_Os03 g49930, 
and LOC_Os03 g49940, whereas three other 
genes encode retrotransposon or expressed 
proteins (Fig. 1C). Knocking out LOC_Os03 
£49930 exhibited the same hypersensitive 
phenotypes as WYJ, implying that LOC_Os03 
£49930 had no effects on thermotolerance 
(fig. S2, A and B). Nevertheless, the LOC_Os03 
g49900 knockout lines exhibited more sensi- 
tivity to heat than WYJ, whereas the LOC_Os03 
49940 knockout seedlings were more thermo- 
tolerant than WYJ (Fig. 1, D and E, and fig. S2, C 
to F), indicating that LOC_Os03 g49900 and 
LOC_Os03 g49940 were both thermotolerance- 
related candidate genes at 773. The LOC_Os03 
£49900 gene was defined as TT3.1 and LOC_ 
Os03 g49940 as TT3.2 for further investiga- 
tion. Sequence comparisons of NIL-773”*” 
and NIL-773©* within 773.1 and TT3.2 un- 
covered one amino acid substitution in 773.1 
(Ala”5_,Val) and TT3.2 (Ser*“?—Gly), respec- 
tively (Fig. 1C). Further protein sequence align- 
ments of TT3.1, TT3.2, and their homologs 
revealed that TT3.1 and TT3.2 are conserved 
in many monocots (fig. S3). We also found that 
heat treatment induced the expression of 
TT3.1 and TT3.2 and that their expression 
levels between NIL-773”"” and NIL-TT3©@* 
were not significantly different (fig. S4, A and 
B), which was consistent with the absence of 
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heat-responsive elements among polymorphic 
regulatory elements in the 773.7 and TT3.2 
promotors (fig. $4, C and D). 

We then overexpressed the coding region of 
TT3.1"™" and TT3.1°* in WYJ, respectively, 
and found that the overexpression lines were 
both more thermotolerant than WYJ (Fig. 1, F 
and G, and fig. S2, Gand H). We also found 
that with the increased expression levels of 
TT3.1"" and TT3.1" in overexpression lines, 
seedlings exhibited enhanced thermotolerance 
(fig. S2, I to K). However, at similar levels of 
gene expression, overexpression-773.1°°* was 
more tolerant than overexpression-773.1 Wey. 
under heat stress, which indicates that 773.1“ 
possessed the stronger effect on heat tolerance 
and is an elite allele for improving thermotol- 
erance. Next, we overexpressed the coding 
sequence (CDS) of T1320 in WYJ and found 
that the transgenic plants displayed weaker 
heat tolerance than WYJ (fig. S2L). Together, 
these results suggested that 773.1 and T73.2 are 
both candidate genes for TT3 locus and that 
TT3.1 is a positive regulator to promote thermo- 
tolerance, whereas 773.2 plays a negative role. 

We generated the ##3.1t#3.2 double mutant by 
crossing t¢3.1 with tt3.2. The tt3.1 mutant was 
sensitive to heat stress; the ##3.2 mutant was 
tolerant to heat stress, and the #3.1tt3.2 double 
mutant resembled the ##3.2 mutant in terms of 
thermotolerance (fig. S2, M and N), suggesting 
that 773.1 acts upstream of 773.2 to regulate 
heat tolerance. 


TT3 enhances thermotolerance at 
reproductive stages 


To test whether the heat tolerance response 
of TT3 comes at the cost of growth and yields, 
we performed yield-trait evaluation. Field ex- 
periments showed that NIL-773”"”” and NIL- 
TT3" had no significant difference in yield 
traits (fig. S5, A and B). Overexpression-773.10"* 
and ¢t3.2 mutant also exhibited no significant 
differences in yield traits compared with those 
of WYJ (fig. S5, C to E), which indicates that the 
TT3 locus and the genetic modification of TT3 
candidate genes increased thermotolerance 
without obvious growth and yield penalties. 

Next, we evaluated the thermotolerance of 
NIL-7T3 at the reproductive stage in the arti- 
ficial greenhouse with high-temperature treat- 
ments. At the heading stage, all leaves and 
tillers in NIL-773©* plants remained green 
and alive, whereas those of the NIL-773”” 
plants were dried and almost dead (Fig. 2A). 
The NIL-773°°” plants showed a marked 
increase in seed setting rate and 1000-grain 
weight and an approximately onefold increase 
in grain yield per plant compared with that of 
NIL-T73"”” (Fig. 2, B to D). At the grain-filling 
stage, no significant differences were observed 
in leaves and tiller status between NIL-T73"” 
and NIL-T73°°" (fig. S5F). However, NIL- 
TT3©“* plants had significantly increased in 
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Fig. 1. Identification of TT3 locus for rice thermotolerance. (A and 
B) Photos (A) and survival rate (B) of 12-day-old NIL-T73 rice plants under 
normal (control) or recovery conditions for 7 days after heat stress (42°C, 


14 hours). Scale bars, 5 cm. Data are mean + SEM (n = 8 biological replicates; 


each replicate is 24 plants). Significant differences were determined by 
a two-tailed t test. ***P < 0.001. (C) TT3 was localized to a 36.6-kb region 
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(D), tt3.2 mutants (E), overexpression-T73.1"% (OE-TT3.1") (F), and 
overexpression-173.1°°% (OE-TT3.1°°") (G) transgenic rice plants under 
normal (control) or recovery conditions for 7 days after heat stress (42°C, 
14 hours). Scale bars, 5 cm. 


seed setting rate and 1000-grain weight and 
had an approximately onefold increase in 
grain yield per plant compared with that of 
NIL-TT3"” (fig. S5, G to I). Field experiments 
also confirmed that the TT3 locus improved 
heat tolerance at the reproductive stage. In 
field conditions (fig. S5J), no significant dif- 
ferences were observed in phenotypes and 
average plot yield between NIL-7T. 37 and 
NIL-773 (fig. S5, K and L). However, under 
high-temperature treatment at the field, with 
sunlight-heated greenhouses covered with 
plastic film (fig. $5, M and N), NIL-773°C* 
showed a 20% increase in average plot yield 
compared with that of NIL-773"”” (fig. S50). 
These results further support the notion that 
the 773“ allele has potential for improving 
heat tolerance. 

We also determined the thermotolerance 
of TT3 transgenic and mutant lines at the 
heading stage in the artificial greenhouse and 
showed that heat treatment had no impact on 
phenotypes of overexpression-773.1°° and 
tt3.2, whereas in WYJ, leaves were barely green 
and almost dead (Fig. 2E). Overexpression- 
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TT3.1°"* and tt3.2 exhibited a significant in- 
crease in seed setting rate and 1000-grain weight 
and 2.6- to 3.8-fold increases in grain yield per 
plant compared with that of WYJ (Fig. 2, F 
to H). Under normal conditions, pollen fertility 
was similar in WYJ, overexpression-773.1°@™, 
and tt3.2 (fig. S5, P and Q), whereas after heat 
treatment, pollen fertility of overexpression- 
T7T3.1°** and tt3.2 was significantly higher 
than that of WYJ (fig. S5, P and R). Collectively, 
these results indicated that overexpressing 
TT3.1 or knocking out 773.2 conferred signif- 
icant yield increases under heat stress, thus 
providing the strategies for breeding highly 
heat-tolerant crops. 


Heat stress triggers TT3.1 and TT3.2 
translocation to endosomes 


TT3.1 was predicted to encode a RING-type E3 
ligase with transmembrane domains (fig. 
S6A). Subcellular localization assays found 
that TT3.1-green fluorescent protein (GFP) 
showed plasma membrane localization with 
cytosolic puncta (Fig. 3A and fig. S6B). To in- 
vestigate the nature of those puncta, we per- 
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formed FM4-64 staining in p7T3.1::TT3.1-GFP 
transgenic rice roots, showing that TT3.1-GFP 
localized to endosomes (fig. S6C), which was 
confirmed by colocalization with the early 
[SCAMP1-red fluorescent protein (RFP)] 
and late endosome markers (ARA7-RFP) in 
Nicotiana benthamiana (fig. S6, D and E). 
These results suggested that TT3.1 localized 
to the plasma membrane and endosomes. 

TT3.2 was an unknown transmembrane pro- 
tein that contained a chloroplast transit peptide 
(fig. S6F). Transient expression assays showed 
that TT3.2 mainly localized to the chloroplast 
(Fig. 3B and fig. S6G). We also found a few 
TT3.2 puncta colocalized with FM4-64-labeled 
endosomes in 773.2-GFP transgenic rice roots 
(fig. S6H). Therefore, we speculated that TT3.1 
might recruit newly synthesized TT3.2 chloro- 
plast precursors in the cytosol to endosomes. 
This hypothesis was verified by the colocal- 
ization of TT3.1-GFP and TT3.2-mCherry on 
intracellular puncta, likely to be endosomes 
(Fig. 3C, left). 

We tracked TT3.1 dynamics and found that 
under normal conditions, TT3.1 largely showed 
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Fig. 2. TT3 enhances rice thermotolerance at reproductive stages. (A) Phenotypes of mature plants 

and total grains per plant in NIL-773 after 30 days of high-temperature treatment (38° and 34°C, day and night) at 
the heading stage. Scale bars, 5 cm. (B to D) Comparison of seed setting rate (B), 1000-grain weight (C), 

and grain yield per plant (D) between NIL-773" and NIL-TT3©“ after 30 days of high-temperature treatment 
(38° and 34°C, day and night) at heading stage. Data are mean + SEM (n = 10 plants). Significant differences 
were determined by a two-tailed t test. **P < 0.01, ***P < 0.001, ****P < 0.0001. (E) Phenotypes of mature plants 
and total grains per plant in WYJ, overexpression-T73.1°C“, and tt3.2 mutant plants after 30 days of high- 
temperature treatment (38° and 34°C, day and night) at the heading stage. Scale bars, 5 cm. (F to H) Comparison 
of seed setting rate (F), 1000-grain weight (G), and grain yield per plant (H) between WYJ and transgenic rice 
plants after 30 days of high-temperature treatment (38° and 34°C, day and night) at heading stage. Data are mean 
+ SEM (n = 8 plants). Significant differences were determined by one-way analysis of variance with post hoc 
Dunnett's multiple comparisons test, with different letters at P < 0.01. 


plasma membrane localization (Fig. 3D, left), 
whereas under heat stress, TT3.1 translocated 
from the cell surface into intracellular com- 
partments and eventually lost its plasma mem- 
brane localization patterns in N. benthamiana 
(Fig. 3D, middle and right) and in rice (fig. S7). 
We further investigated the dynamic relation- 
ship between TT3.1 and TT3.2 and found that 
heat treatment promoted the cotranslocation 
of TT3.1 and TT3.2 to endosomes (Fig. 3C, 
middle and right). Together, heat stress trig- 
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gered TT3.1 internalization from the plasma 
membrane to endosomes to associate with 
TT3.2. 


TT3.1 relocalizes TT3.2 to endosomes under 
heat stress 


We characterized the spatial relationship be- 
tween TT3.1 and TT3.2 on endosomes at nano- 
meter resolution. Under normal conditions, 
TT3.2”™ was observed on the thylakoid mem- 
branes and exhibited a similar number of gold 
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particles when compared to TT3.2°"* (fig. S8, 
A to C). After heat treatment, TT3.2 accumu- 
lated on thylakoid membranes of NIL-TT3”” 
(Fig. 4, A and C), whereas few TT3.2 gold par- 
ticles were detected on thylakoid membranes 
of NIL-7T3° (Fig. 4, B and C). These results 
implied that TT3.2 localized to the thylakoid 
membranes and that TT3.2 protein abundance 
decreased in chloroplasts in NIL-773°@" but 
not in NIL-773”” upon heat stress. 

By contrast, the results of immunogold la- 
beling showed that upon heat stress, TT3.2 
was more accumulated on multivesicular en- 
dosomes in NIL-773© than in NIL-TT3”” 
(Fig. 4, D to F). These results suggested that 
TT3.2 precursor proteins destined to chloro- 
plasts can be selected into multivesicular endo- 
somes by TT3.1 for the vacuolar degradation, 
and the sorting efficiency was higher in NIL- 
773 than in NIL-T73"™”. Thus, the mature 
TT3.2 proteins of NIL-773°¢™ accumulated 
less on thylakoid membranes than those of 
NIL- 773” under heat stress. 

Plasma membrane- and endosome-localized 
TT3.1-GFP proteins were also confirmed by 
transmission electron microscopy imaging (fig. 
S8, D to I). Furthermore, double immunogold- 
labeling assays showed that few gold particles 
of TT3.1 and TT3.2 were observed on multi- 
vesicular endosomes under normal conditions 
(Fig. 4, G and I). However, the numbers of 
TT3.1 and TT3.2 gold particles were increased 
on multivesicular endosomes upon heat stress 
(Fig. 4, H and I), which supports the hypoth- 
esis that TT3.1 relocalizes TT3.2 chloroplast 
precursor protein to multivesicular endosomes 
under heat stress. 


TT3.1 ubiquitinates TT3.2 on endosomes 
for degradation 


We performed in vitro ubiquitination assays to 
detect the ubiquitin ligase activity of truncated 
TT3.1 (TT3.1C) (fig. S9A). Polyubiquitination 
conjugation was observed in TT3.1C-maltose 
binding protein (MBP)- purified proteins, 
whereas site mutation of the TT3.1C RING 
domain (MTT3.1C) abolished the ligase activ- 
ity (fig. S9, A and B). This result indicated 
that TT3.1 was a functional E3 ligase, and the 
ligase activity was dependent on the 441 and 
443 amino acids sites of the RING domain. 

To test the functional relationship between 
TT3.1 and TT3.2, we first verified their inter- 
actions by yeast two-hybrid (Y2H) assays. 
TT3.1"”7 and TT3.2~~”” exhibited a strong 
interaction; unexpectedly, the interaction of 
T13.1°C" with TT3.2°C™ and that of TT3.1°°" 
with TT3.2””’ were weaker than that of 
TT3.1"” with TT3.2”” (Fig. 5A). However, 
site mutations of TT3.1°°* in the ubiquitin 
functional RING domain (MTT3.1°“"") (fig. 
S9A) restored the strong interaction between 
MTT3.1°"* and TT3.2 (Fig. 5A). Moreover, the 
split firefly luciferase complementation assay 
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Fig. 3. TT3.1 senses high temperature and internalizes to colocalize with 
TT3.1-GFP in N. benthamiana 

leaf cells. TT3.1-GFP colocalized with the plasma membrane-localized guanosine 
triphosphatase OsRacl-mCherry. White arrows indicate the punctate signals of 
TT3.1-GFP. The rectangle indicates the enlarged part. Scale bars, 20 um. 

(B) Localization of TT3.2-GFP to the chloroplast. The rectangle indicates the 
enlarged part. Scale bars, 10 um. (C) Colocalization of TT3.1-GFP and TT3.2- 
mCherry in N. benthamiana leaf cells after high-temperature treatment (42°C for 
0, 2, and 3.5 hours) in the presence of cycloheximide (CHX, 50 uM, 2 hours 


TT3.2. (A) Plasma membrane localization o 


confirmed that TT3.1 interacted with TT3.2 (fig. 
S9C). We next performed an in vitro ubiquiti- 
nation assay, which showed that TT3.1C-MBP 
could polyubiquitinate TT3.2””7-Flag and 
TT3.2°*"-Flag, respectively (fig. S9D). These 
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results demonstrated that TT3.1 targets TT3.2 
for polyubiquitination. 

We next examined whether TT3.1 deter- 
mined the degradation of TT3.2 in vivo and 
found that under heat stress, TT3.1°°" in 


17 June 2022 


RESEARCH | RESEARCH ARTICLE 
A GFP OsRac1-mCherry Merged B Chlorophyll Merged 
y 
o & oO 
LL LL 
9 a . 
: N) : 
= N 
oO oO 
- E 
c i= 
oO ®o 
= = 
®o ® 
> > 
& & 
Cc Cc 
Wi Ww 
Cc CHX + 42°C D CHX + 42°C 
Oh 2h 3.5h Oh 2h 3.5h 
a o 
re i 
° sy 
6 os 
E E 
b bE — ee be 
2 
oO ” 
5 d 
: 5 
~ > 
8 & 
N (= 
ma uw 
‘a 
ke 
2 i 
2 6 
Ss E 
[e) _— 
= oO 
S & 
2) 
O 
xo) ze) 
®o ®o 
> > 
®o ® ®o 
" fo at : 


before heat treatment). White arrows indicate the punctate signals of TT3.1-GFP 
in the cytoplasm: yellow arrows indicate the punctate signals of TT3.2-mCherry; 
blue arrows indicate the colocalization of TT3.1-GFP and TT3.2-mCherry. 

Scale bars, 20 um. (D) Localization of TT3.1-GFP after high-temperature treatment 
(42°C for 0, 2, and 3.5 hours) in the presence of CHX (50 uM, 2 hours before 
heat treatment). OsRacl-mCherry is a plasma membrane marker. White arrows 
indicate the punctate signals of TT3.1-GFP in the cytoplasm. The rectangle 
indicates the enlarged part. Scale bars, 20 um. Experiments were repeated 
three times with similar results. 


NIL-773°@* promoted TT3.2 degradation, 
whereas the degradation efficiency of TT3.2 
declined in the 773.1"”” background (Fig. 5B). 
This result indicated that TT3.1°““* had a stron- 
ger function for TT3.2 degradation than TI3.1"™. 
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pUBI::TT3. 1°914-GFP (anti-GFP/TT3.2) 


Fig. 4. Heat stress triggers TT3.1 and TT3.2 sorting into multivesicular 
endosomes. (A to C) Immunogold electron microscopy with the anti-TT3.2 
antibody (raised in rabbit) showing gold particles of TT3.2 localized in thylakoid 
membranes in chloroplasts of NIL-TT3” (A) and NIL-TT3°C% (B) leaf cells, 
and quantitative analysis (C) under heat treatment (42°C for 8 hours). The images 
with labels Al and Bl (right) are enlargements of the frames in white (left). 

The black arrows indicate TT3.2 immunogold particles at the thylakoid membrane 
(TM). Scale bars, 400 nm in (A) and (B) and 200 nm in (Al) and (B1). Data 

are mean + SEM (n = 20 regions: each region is 1 um*). Significant differences were 
determined by a two-tailed t test. ****P < 0.0001. (D to F) TT3.2 immunogold 
particles localized to endosomes of NIL-T73"” (D) and NIL-TT3°* (E) leaf cells, 
and quantitative analysis (F) under heat treatment (42°C for 8 hours). Images 
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labeled D1 and E1 (right) are enlargements of the frames in white (left). The black 
arrows indicate TT3.2 immunogold particles in endosomes. Scale bars, 200 nm. 
Data are mean + SEM (n = 12 endosomes). Significant differences were determined 
by a two-tailed t test. ****P < 0.0001. (G to I) Immunogold electron microscopy 
with anti-TT3.2 antibody (raised in rabbit) against TT3.2 (5 nm gold) and 
anti-GFP antibody (raised in mice) against GFP (10 nm gold), showing gold 
particles of TT3.2 and TT3.1-GFP localized in endosomes in pUBI::TT3.1°°4-GFP 
transgenic rice plants under control (G) or heat treatment (42°C for 8 hours) 
(H), and quantitative analysis (I). The black and red arrows indicate TT3.2 and 
TT3.1-GFP immunogold particles in endosomes, respectively. Scale bars, 200 nm. 
Data are mean + SEM (n = 20 endosomes). Significant differences were 
determined by a two-tailed t test. ****P < 0.0001. 


To clarify the degradation pathway of TT3.2, 
we performed immunoblot assays with drug 
treatments. In NIL-773““, the degradation 
rate of TT3.2 remained similar under treat- 
ments with cycloheximide and cycloheximide + 
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MG132 (proteasome inhibitor) (Fig. 5C), which 
suggests that TT3.2 degradation is not medi- 
ated by the 26S proteasome pathway. However, 
treatment with cycloheximide + E64d (vacuolar 
hydrolases inhibitor) suppressed both TT3.2 
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degradation (Fig. 5C) and the thermotolerance 
of NIL-7T7T3°** (fig. S9E), indicating that 
the endosome-to-vacuole pathway participated 
in TT3.2 degradation. Although the tendency of 
TT3.2 degradation is similar in the 773.17” 
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Fig. 5. TT3.1 interacts with TT3.2 for degradation, and overexpression-TT3.1 
and tt3.2 protect thylakoids. (A) Interaction of TT3.1 and TT3.2 in Y2H assay. TT3.1 
and MTT3.1 (point mutations of TT3.1) CDSs were cloned into the pBT3-STE vector. 
TT3.2 CDSs were cloned into the pPR3-N vector. The combination of TT3.1-STE and 
pOstl-Nubl was used as a positive control. The combination of TT3.1-STE and 
pPR3-N was used as a negative control. Medium contained 15 mM 3-aminotriazole. 
(B) Degradation efficiency of TT3.2 in NIL-TT3”” and NIL-TT3°¢” plants upon heat 
stress (0, 4, 8, and 12 hours). Equal protein loading was confirmed with the anti- 
CF1p antibody. (C) Heat promoted the degradation of TT3.2 via the endosomes-to- 
vacuole degradation pathway. Twelve-day-old NIL-TT3° rice seedlings were 
collected after 42°C heat treatment in the presence of CHX (protein synthesis inhibitor 


and 773.1“ backgrounds, the total degrada- 
tion efficiency is much lower in 773.1” than 
in 773.1°°" (fig. SOF). In short, TT3.1°C* was 
more efficient than TT3.1”™ in promoting 
TT3.2 degradation via the endosome-to-vacuole 
pathway. 


TT3.2 degradation protects thylakoids against 
heat-stress damage 


To investigate the effect of TT3.2 on chloro- 
plasts, we carried out transmission electron 
microscopy analysis. Under normal conditions, 
the chloroplasts in NIL-773, transgenic, and mu- 
tant plants developed normally (fig. S9G). After 
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heat treatment, chloroplasts in NIL-773"”” 


and ##3.1 mutant were disrupted with distorted 
and impaired stroma lamellae (Fig. 5D). How- 
ever, in NIL-773C, overexpression-773.1""”, 
overexpression-773.1°°, and tt3.2 mutant 
plants, the thylakoids retained their initial 
configurations (Fig. 5D). We also found that a 
similar abundance of photosynthetic pro- 
tein complexes was observed in NIL-TT3 (Fig. 
5E), WYJ (wild type), transgenic, and mu- 
tant plants under normal conditions (Fig. 5F), 
whereas under heat stress, the abundance of 
photosynthetic protein complexes was de- 
creased in NIL-773"”, WYJ, and tt3.1 but 
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in eukaryotes, 50 uM), CHX with MG132 (proteasome inhibitor, 50 uM), and CHX with 
E64d (vacuolar hydrolases inhibitor, 50 uM), respectively. TT3.2 protein was confirmed 
with anti-TT3.2 antibody, and equal protein loading was confirmed with anti-CF1p 
antibody. (D) Transmission electron micrographs of chloroplast ultrastructure from 
leaves of NIL-TT3, tt3.1 mutant, overexpression-773.1"”, overexpression-T73.1°°, 
and tt3.2 mutant plants after 42°C heat treatment for 14 hours. Red arrows indicate 
collapsed thylakoids. Scale bars, 400 nm. (E and F) Blue native-polyacrylamide gel 
electrophoresis analysis of thylakoid membrane proteins from detached leaves of 
NIL-TT3 (E), WYJ, transgenic, and mutant plants (F) under normal growth and heat- 
stress conditions (42°C, 14 hours). PSI, photosystem |; NDH, [NAD(P)H dehydrogenase]; 
LHC, light-harvesting complex; mc, megacomplex; sc, supercomplex. 


remained stable in NIL-TT3, overexpression- 
TT3.1"”, overexpression-773.1°C, and tt3.2 
(Fig. 5, E and F). 

We further detected the abundance of core 
subunits of photosystem II (PSII) (e.g., D1 
and D2), showing that D1 and D2 abundance 
remained similar in NIL-773"” and NIL-T73°™* 
under normal conditions (fig. S9H). Upon heat 
stress, D1 and D2 abundance was decreased 
in NIL-773™™” but stabilized in NIL-T73 
(fig. SOH). Furthermore, the abundance of D1 
and D2 was stable in overexpression-773.1"”, F 
overexpression-773.1°"", and tt3.2 but de- 
creased in WYJ and ?#t3.1 under heat stress 
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Fig. 6. Proposed model for TT3.1 and TT3.2 function in thermotolerance. 
Under normal conditions, TT3.1”7 and TT3.1°*“ mainly localize to the plasma 
membrane; TT3.2" and TT3.2°%" precursor proteins are imported from 

the cytosol to chloroplasts and do not influence plant growth. Upon heat stress, 
TT3.1°C with high ligase activity is sorted into endosomes, where it interacts with 
TT3.2°*¥ precursor proteins for ubiquitination, and the latter is delivered to 


(fig. S91). Together, these results suggested 
that the accumulation of chloroplast-localized 
TT3.2 is detrimental to the PSII complex and 
thylakoids, and TT3.1 favors maintenance of 
chloroplast stability by mediating the degra- 
dation of the chloroplast precursor protein 
TT3.2 during heat stress. 


Discussion 


Thermosensing is the first step in heat-stress 
responses (19-21), and the identification of 
thermosensors is challenging (22). In this 
study, we showed that heat-triggered trans- 
location of TT3.1 from the plasma membrane 
to endosomes was likely to transduce environ- 
mental signals from the cell surface into intra- 
cellular organelles. The endosome-localized 
TT3.1 protein ubiquitinated the TT3.2 chloro- 
plast precursor protein, which was then sorted 
into endosomes for vacuolar degradation. In 
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the African rice CG14 variety, TT3.1 has high 
E3 ubiquitin ligase activity to rapidly degrade 
the TT3.2 chloroplast precursor protein, which 
can reduce TT3.2-triggered chloroplast damage 
caused by heat stress, exhibiting high thermo- 
tolerance under heat stress (Fig. 6, right). How- 
ever, in the Asian rice WYJ variety, TT3.1 has 
low E3 ubiquitin ligase activity, resulting in 
the accumulation of TT3.2 and consequent 
destruction of chloroplasts under heat stress, 
which manifests as a heat-sensitive phenotype 
(Fig. 6, left). In all, we proposed that TT3.1 
might be a potential thermosensor involved in 
heat-stress responses (22). TT3.2 is a chloro- 
plast membrane protein with unknown func- 
tions. How TT3.2 accumulation promotes 
chloroplast damage induced by heat stress is 
an intriguing question for future research. In 
addition, we uncovered a mechanism for chlo- 
roplast precursor protein degradation medi- 
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endosomes for vacuolar degradation. Lesser imported TT3.2°*"* in chloroplasts 
avoid the disruption of the PSII complex and thylakoids, thus enhancing 
thermotolerance and increasing rice productivity (right). By contrast, low ligase 
activity of TT3.1" results in reduced degradation of TT3.2””. Overaccumulation 
of TT3.2" in chloroplasts disrupts the PSII complex and thylakoid stability, 
leading to thermosensitivity with growth inhibition and yield loss (left). 


ated by the endosome-to-vacuole pathway 
upon heat stress. 

TI3"Y (173.1 /TT3.2"*”) from temperate 
japonica rice conferred a heat-sensitive phe- 
notype, which is related to the distribution 
of temperate japonica rice in temperate and 
cooler regions (23). By contrast, TT3°?* 
(TT3.10*/T 73.2) from African rice con- 
tributed a heat-tolerant phenotype, which is 
related to the distribution of African rice in 
tropical regions (24). Therefore, we speculated 
that TT3 was selected as an adaptation to dif- 
ferent temperature climates. The 773 natural 
allele or genetic editing of TT3.1/TT3.2 in rice 
enhanced heat tolerance and reduced loss of 
grain yield caused by heat stress. These genes 
are conserved in other major crops, such as 
maize and wheat, and are considered to be 
valuable resources for breeding highly heat 
stress-tolerant crops. 
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OIL POLLUTION 
Chronic oiling in global oceans 


Yanzhu Dong", Yongxue Liu’*, Chuanmin Hu, lan R. MacDonald‘, Yingcheng Lu® 


Ocean oil slicks can be attributed to natural seepages or to anthropogenic discharges. To date, the global 
picture of their distribution and relative natural and anthropogenic contributions remains unclear. Here, 

by analyzing 563,705 Sentinel-1 images from 2014-2019, we provide the first global map of oil slicks and 
a detailed inventory of static-and-persistent sources (natural seeps, platforms, and pipelines). About 90% 
of oil slicks were within 160 kilometers of shorelines, with 21 high-density slick belts coinciding well with 
shipping routes. Quantified by slick area, the proportion of anthropogenic discharges was an order of 
magnitude greater than natural seepages (94 versus 6%), in contrast to the previous estimate quantified by 


volume during 1990-1999 (54 versus 46%). Our findings reveal that the present-day anthropogenic 
contribution to marine oil pollution may have been substantially underestimated. 


cean oil slicks are microscopically thin 

and highly transient floating layers of 

liquid hydrocarbons (7). They can seep 

naturally from seafloor hydrocarbon res- 

ervoirs or be intentionally or acciden- 
tally discharged from ships, offshore oil/gas 
infrastructures, and land-based sources (2, 3) 
(Fig. 1). Oil pollution has been a global con- 
cern because of its adverse impacts on marine 
biology (4, 5) and ecology (6, 7). The indirect 
consequences, such as the toxicity along the 
food chain (4) and the loss of ecological di- 
versity (8-10), will persist for a long time and 
further weaken regional sustainable devel- 
opment capacity (5, 17). A comprehensive 
investigation of the global oil slick distribu- 
tion can benefit studies analyzing the environ- 
mental and socioeconomic effects and practices 
that provide crucial knowledge for related 
authorities and multiple stakeholders to pre- 
dict trajectories of oil slicks and further 
implement more effective and informed regu- 
lations (72, 13). 

Establishing a better understanding of oil 
slicks on a global scale, including their extent, 
hotspots, and natural and anthropogenic con- 
tributions, is usually compromised because 
of the extensive areas and tremendous data 
involved. It is also challenging to locate oil 
slicks because of their ephemeral and errant 
nature: Their advection and evolution heavily 
depend on winds, currents, waves, and other 
sea conditions (74). It is especially acute for 
accidental human-induced oil slicks occurring 
at random positions: Only a tiny percentage of 


1School of Geography and Ocean Science, Nanjing University, 
Nanjing 210023, China. *Collaborative Innovation Center of 
South China Sea Studies, Nanjing University, Nanjing 
210093, China. “College of Marine Science, University of 
South Florida, St. Petersburg, FL 33701, USA. “Department 
of Earth, Ocean, and Atmospheric Science, Florida State 
University, Tallahassee, FL 32306, USA. *International 
Institute for Earth System Sciences, Nanjing University, 
Nanjing 210023, China. 

*Corresponding author. Email: yongxue@nju.edu.cn 


Dong et al., Science 376, 1300-1304 (2022) 


these have been recorded, and even fewer cases 
have come into the sight of research. Likewise, 
previous estimates of natural seep contribu- 
tions were variable, from ~10% in the 1980s 
(15) to ~46% in 1990-1999 (16, 17), and dubious— 
for example, the global extrapolation was 
based on limited known cases (J6). In the 
context of the increasing human impact on 
the oceans in the present time (78), a sound 
baseline of global oil slicks will be meaningful 
for protecting the ocean we need for the future 
we want. 

Synthetic aperture radar (SAR) offers an ef- 
fective means with which to acquire spatially 
explicit information about oil slicks over a 
large area, and this method has been widely 
adopted in oil spill monitoring (/, 2). From 
all offshore Sentinel-1A/1B SAR images of 
2014-2019 (7 = 563,705 images), we identified 
452,057 locations of oil slicks and extracted 
their extents (see the materials and methods: 


im This study, 2014-2019 
(by area) 

mm NRC, 1990-1999 
(by volume) 


Land-based/ship/other sources 
91.65% 


50.01% 
SSeS 


Terrestrial discharges 
Coastal facilities 


Vessel discharges 


SAR data, oil slick extraction; figs. S1 to S3). 
We detected a cumulative oil slick area of 
1.51 x 10° km? (more than twice the area of 
France) in global oceans and established an 
extensive global oil slick dataset that enables 
an up-to-date understanding of their distribu- 
tions and sources. This information allowed 
us to address the following three questions: 
Where and at what magnitude did oil slicks 
occur globally? Where and how frequently 
did the static and persistent sources seep or 
leak oil? What proportions did natural seeps 
and human activities contribute to the total 
slick area on regional and global scales? 

In answering the first question, we used 
three metrics, distance, zonal, and gridded 
statistics (see the materials and methods: oil 
slicks spatial analysis), to demonstrate the 
spatial pattern of the global oil slicks. Our 
analysis showed that the spatial distribution 
of oil slicks was exceptionally uneven in the 
following ways. 

Oil slicks were primarily distributed along 
the coasts. Approximately 50% of the cumu- 
lative oil slick area was observed within 38 km 
from the coastline, with the peak at 7 km, 
and ~90% of the slicks were within 160 km 
(Fig. 2A). When oil slick areas were normal- 
ized by the SAR observation areas (purple 
shadows in Fig. 2A), the slick areas decreased 
overall with greater distance from the coast. 
Patterns of source distribution also became 
clear. The greatest magnitude of platform slicks 
occured between 20 and 40 km from the coast, 
whereas natural oil seeps were prevalent within 
90 km. These peaks reflect the high frequency 
of present-day offshore production and seep- 
age zones from the outer continental slope (2). 

The area of oil slicks varied considerably 
among different coasts and seas. For the 31 
offshore regions (Fig. 2B and fig. S4), >30% of 
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Fig. 1. Oil slicks from different sources entering into the marine environment. The orange bars in the 
legend refer to the contributions of natural and anthropogenic sources during 2014-2019 derived in this 
study, and the green bars are those of 1990-1999 estimated by the NRC. 
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Fig. 2. Oil slick statistics (2014-2019). (A) Oil slick area histogram by their distances to the coasts (x-axis step: 1 km, left y-axis: oil slick areas from all sources, right 
cyan y-axis: oil slick areas from natural seeps and platform sources). (B) Cumulative and normalized oil slick areas 
areas from all available SAR images, and “normalized” means oil slick areas per full-coverage observation. (€) Proportions of the four oil slick sources in these regions. 


all oil slicks were found in the Java Sea and 
surroundings (1.69 x 10° km”), the Mediterra- 
nean Sea (1.54 x 10° km”), and the South China 
Sea and surroundings (1.33 x 10° km). After 
normalizing oil slick area per region (see the 
materials and methods: oil slick spatial anal- 
ysis), the top three regions were the Java Sea, 
the South China Sea, and the Gulf of Guinea 
and surroundings (1.03 x 10, 9.94 x 10”, and 
5.68 x 10” km? per full-coverage observation, 
respectively). For the Gulf of Mexico, we de- 
tected an oil slick area of 367.19 km? per full- 
coverage observation during 2014-2019, ~50% 
less than previous estimates (2), although re- 
gions of persistent oil slicks observed were 
identical. The difference probably results 
from the higher spatial resolutions of the 
SAR data used in this study (predominantly 
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10 m for Sentinel-1 versus the previous 100 
to 400 m). 

Oil slick footprints showed hotspots glob- 
ally. Although oil slicks were found widely, the 
hotspots were primarily located in natural seep 
zones (such as the Caspian Sea and the Black 
Sea) and offshore oil and gas development 
areas (such as the coast of California and the 
Persian Gulf) (Fig. 3A and fig. $5). Addition- 
ally, in the gridded map (0.1° x 0.1° cell size) of 
oil slicks, 21 distinct high-density belts were 
visually found to coincide well with shipping 
routes, even though oil slicks may drift under 
winds and currents. These included 10 prom- 
inent oil pollution belts in the Mediterranean 
Sea and the Black Sea (Fig. 3B) and 11 other 
ship-induced oil pollution belts in the Strait 
of Hormuz, the Bay of Bengal, the Strait of 
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in the 31 regions. Here, “cumulative area” means oil slick 


Malacca, the Java Sea, and the Yellow Sea 
(Fig. 3, C to G). Outside of these belts, we 
also recognized 82,982 linear oil slicks that 
were likely from ship discharges, and these 
slicks and the belts together account for at least 
19.8% of the global total. Although the Interna- 
tional Convention for the Prevention of Pollu- 
tion from Ships (MARPOL 73/78) has been in 
place since 1983 (19), the substantial contribu- 
tion of the ship discharges still suggests the 
urgent need for more effective international 
cooperation and informed regulation. 

The second question we wanted to address 
is what are the distribution and occurrence 
rates of oil slicks from static and persistent 
sources? The global oil slick footprints during 
2014-2019 revealed 586 aggregation centers 


with recurrent oil slicks (Fig. 4A and table S1). 
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Fig. 3. Global distribution of oil slicks (2014-2019). (A) Normalized area of oil slicks in a 0.1° x 0.1° grid 
representing oil slick area per full-coverage SAR observation. (B to G) Enlarged views of areas marked by white 
dashed boxes in (A). High-density belts caused by oil pollution from ships can be observed, and their shape 
coincides well with the shipping routes over these areas. Color bar is the same for (A) and (C) to (G). 


Among them, 14 were related to offshore aqua- 
culture net cages that produced biogenic slicks 
(fig. S6) and therefore were excluded from 
further analysis. 

Natural seepages from seafloor hydrocarbon 
reservoirs have long been regarded as a pri- 
mary source of oil slicks (17, 20), and some hot- 
spots have been analyzed in detail (2, 21, 22). 
Here, a total of 435 aggregation centers were 
recognized as natural seep centers (NSCs) (see 
the materials and methods: oil slick classifica- 
tion). These NSCs were located within 3.36 to 
363.57 km of the shorelines, with water depths 
ranging from 4 to 5526 m. The Gulf of Mexico 
had >40% of NSCs (with a total of 176 ag- 


Dong e¢ al., Science 376, 1300-1304 (2022) 


gregation centers found) (fig. S7), followed by 
the Gulf of Guinea (62 centers), the Caspian 
Sea (43 centers), the coasts of Ecuador and 
Peru (31 centers), and other areas (figs. S8 to 
S18). The oil slick detection rate of NSCs varied 
greatly even in an identical region (Fig. 4, B 
to L), suggesting complicated controlling fac- 
tors among natural seeps. On the basis of re- 
current detection over time, we discovered 
that the most active natural seepage rates 
were in the western Caspian Sea (61.4%) 
and at the entrance of the Red Sea (59.3%), 
rather than the Gulf of Mexico (58.6%) or the 
coast of California (53. 5%), as previously re- 
ported (16, 23). 
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Compared with the last global natural seep- 
age map compiled in the 1970s (containing a 
total of 190 offshore seepage sites) (20), we dis- 
covered 245 additional NSCs in global oceans 
(fig. SI9A). Approximately 25% (48 sites) of the 
previously recorded seepage sites were not 
identified in our results, mainly in the coastal 
waters of Alaska, Canada, Argentina, India, 
Myanmar, Russia, Australia, New Zealand, 
and the Mozambique Channel. However, con- 
sidering that the minimum SAR observation 
over these areas was 138 times, we suspect that 
the oil seepage rates of these sites may be too 
intermittent to exhibit recognizable aggre- 
gation patterns (fig. S20). Another possible 
reason is that these seepages may have been 
misinterpreted because of technical limita- 
tions or local environment peculiarities. For 
example, the 30-year investigation over off- 
shore basins of Australia showed a few natural 
hydrocarbon seeps (24) instead of the five sites 
reported previously (20). Furthermore, taking 
the Gulf of Mexico and the Lower Congo Basin 
as examples, we found that ~80% of the seep- 
age locations documented in the previous 
studies (2, 27) coincide well with our results 
(fig. S19, B and C). 

Oil slicks from the expanding offshore oil 
and gas development activities (25) have been 
deemed to occur accidentally, and research at- 
tention has largely been limited to reported 
cases or those around damaged or misoper- 
ated platforms (26-28). However, our time- 
series analyses revealed frequent oil leakages 
and discharges from offshore infrastructures 
worldwide. We found that 111 oil slick aggre- 
gation centers (recognized as platform slick 
centers, PSCs) were closely related to energy 
production platforms (figs. S21 to S27), in- 
cluding one decommissioned platform (fig. 
$28). The North Sea and the Norwegian Sea 
contributed nearly a fourth (34 centers) of 
PSCs, followed by the Gulf of Guinea (33 cen- 
ters), South China Sea and Java Sea (15 centers), 
and other regions (Fig. 4A and figs. S7 to S18). 
The detection rates of PSCs in the North Sea 
and Norwegian Sea were low to moderate, 
varying from 0.7% to 11.8%, with a mean 
value of 4.4% (fig. S14). For comparison, 
the PSCs in the Gulf of Guinea exhibited a 
higher mean detection rate of 17.4%, and the 
highest rate (in the Lower Congo Basin) 
reached 69.3% (fig. S8). We suggest that the 
numerous PSCs and the alarming oil slick 
detection rates in some areas reflect ineffec- 
tual or absent marine environmental supervi- 
sion, where time-series satellite monitoring 
could play a vital role. 

Additionally, 26 aggregation centers were 
found to be close (mean distance 0.98 km) to 
pipelines (fig. S29) and were therefore clas- 
sified as pipeline slick centers (PiSCs). Note 
that in the northern-central Gulf of Mexico, 
18 PiSCs were not far from the known natural 
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Fig. 4. Global oil slick aggregation centers. (A) Distribution of NSCs, PSCs, PiSCs, and APCs. (B to L) More 
detailed views in selected regions. The numbers in the brackets represent the range of detection rates. 


seep locations (mean distance 14.23 km) and 
need further investigation to confirm whether 
they resulted from pipelines or from natural 
seeps (see the materials and methods: limit- 
ations and uncertainties). Regardless, oil slicks 
from all PiSCs accounted for only 0.5% of the 
global total. 

To address the third question of what pro- 
portions natural and anthropogenic sources 
contributed to the global total slick area, we 
classified oil slick origins into four categories: 
natural seeps, pipelines, platforms, and other 
anthropogenic sources (ships and land-based 
discharges). On the global scale, their propor- 
tions were 6.2%, 0.5%, 1.6%, and 91.7%, re- 
spectively (Fig. 1). At the regional scale, their 
proportions varied. For example, natural seeps 
accounted for 43.3% in the Gulf of Mexico, 
35.1% off the coast of Ecuador and Peru, and 
26.2% off the coast of California (Fig. 2C). 
Nevertheless, oil slicks from anthropogenic 
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sources dominate global oil slicks at present. 
Considering each region’s diverse area and 
observation area, we found that the weighted 
proportion of anthropogenic sources exceeds 
98% (see the materials and methods: oil slick 
classification; fig. S30). Additionally, mas- 
sive oil spills such as the 2010 Deepwater 
Horizon oil spill would release much more 
oil than other spills. No such oil spill disasters 
occurred during 2014-2019; otherwise, they 
would result in a higher proportion of anthro- 
pogenic sources. 

According to the National Research Coun- 
cil (NRC) estimate of 1990-1999 (16), natural 
and anthropogenic sources contributed 46.15% 
and 53.85% to global oil discharge volume, re- 
spectively. Although we discovered more than 
twice the number of natural seepage sites than 
that reported previously, their contribution 
(6.2% in this study) to the global oil slick area 
is still an order of magnitude lower than the 
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NRC estimate (46.2%). We attribute this to 
the improved global data and methodology 
and to the following. First, the NRC estimates 
were based on several assumptions and glob- 
al extrapolation, and the three major inputs 
(natural seeps, land-based, and operational 
discharges from vessels) had especially high 
levels of uncertainties (76). Second, there 
were increased discharges from anthropo- 
genic sources in recent years. Our statistics 
demonstrate that the area of high-confidence 
oil slicks from ship discharges was at least 
three times that of natural oil slicks. We at- 
tribute this to the expansion of world mari- 
time transportation, the carrying capacity of 
which has risen by 2.5 times from 2000 to 
2019 (fig. S31). Additionally, oil contamination 
was particularly noteworthy from the coasts of 
emerging economies (including China, Vietnam, 
Indonesia, Malaysia, and Brazil), which have 
contributed ~25% of the global total economic 
growth during the past two decades (fig. S32). 
Thus, we argue that the adverse effects of the 
marine ecosystem’s chronic exposure to human- 
induced oil slicks may have been substantially 
underestimated. Indeed, exposure to polycyclic 
aromatic hydrocarbons in concentrations of 
1 to 10 ug/liter has been shown to impair 
cardiac development in larval fishes (70); even 
minor releases of oil may have immediate im- 
pacts on marine biota (5). In addition, given 
the nature of oil slicks and SAR data coverage, 
as well as the various factors influencing their 
detection accuracies (such as unfavorable wind 
conditions and detection limit of oil slick size; 
see the materials and methods: limitations and 
uncertainties), the estimate of anthropogenic 
contribution in this study is likely to be con- 
servative. Therefore, the distribution and mag- 
nitude of oil slicks from anthropogenic origins, 
as well as their potential effects on marine 
ecosystem health, require comprehensive mo- 
nitoring and research. 

Overall, we have established a comprehen- 
sive and up-to-date understanding of global 
oil slicks based on all offshore Sentinel-1 SAR 
images acquired from 2014 to 2019. Our analy- 
sis reveals the highly concentrated distribution 
of oil slicks in nearshore waters and global 
hotspots of oil contaminations. Regulation 
and control of anthropogenic oil slicks are 
challenging because in many cases we cannot 
identify the specific pollution event. We also 
compiled a global inventory of static and per- 
sistent sources (both natural and anthropoge- 
nic), including their exact locations and oil 
slick detection rates, which could then be used 
to help guide future offshore oil and gas ex- 
ploration and pollution management. The de- 
rived oil slick dataset can provide a training 
set for computer-based deep-learning tech- 
niques requiring big data to summarize the 
paradigm and automatically extract oil slicks. 
Finally, our results reveal the overwhelming 
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contributions of anthropogenic oil slicks com- 
pared with those from natural seeps that are 
probably due to increased human activities. 
Under a changing climate and projected in- 
creasing anthropogenic stresses on the health 
of the ocean ecosystem, the findings of this 
study will serve as a scientific and normative 
basis for achieving the Sustainable Develop- 
ment Goal 14 of the United Nations: to con- 
serve and sustainably use the oceans, seas, and 
marine resources for sustainable development. 
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Spin-charge separation in a one-dimensional Fermi 
gas with tunable interactions 


Ruwan Senaratne’+, Danyel Cavazos-Cavazos'}, Sheng Wang”, Feng He°“, Ya-Ting Chang’, 
Aashish Kafle?, Han Pu, Xi-Wen Guan?>*, Randall G. Hulet!* 


Ultracold atoms confined to periodic potentials have proven to be a powerful tool for quantum 
simulation of complex many-body systems. We confine fermions to one dimension to realize the 
Tomonaga-Luttinger liquid model, which describes the highly collective nature of their low-energy 
excitations. We use Bragg spectroscopy to directly excite either the spin or charge waves for 
various strengths of repulsive interaction. We observe that the velocity of the spin and charge 
excitations shift in opposite directions with increasing interaction, a hallmark of spin-charge 
separation. The excitation spectra are in quantitative agreement with the exact solution of the 
Yang-Gaudin model and the Tomonaga-Luttinger liquid theory. Furthermore, we identify effects of 
nonlinear corrections to this theory that arise from band curvature and back-scattering. 


nlike three-dimensional (3D) metals 

whose low-energy excitations are fer- 

mionic quasiparticles, the low-energy 

excitations of 1D fermions are collec- 

tive bosonic spin- and charge-density 
waves (SDWs and CDWs) that disperse lin- 
early, as described by the Tomonaga-Luttinger 
liquid (TLL) theory (/-5). Notably, the SDWs 
and the CDWs of an interacting 1D Fermi gas 
propagate at different speeds, causing a spatial 
separation of the spin and charge excitations 
in the gas. 

Spin-charge separation has been studied 
in quasi-1D solid-state materials in several 
ground-breaking experiments that used either 
momentum-resolved tunneling to determine 
the dispersions (6-8) or angle-resolved photo- 
emission spectroscopy (9-12). Although these 
experiments observe splitting into spin and 
charge excitations, a quantitative analysis of 
these data has proved challenging because of 
the complexity of the electronic structure and 
the unavoidable presence of impurities and 
defects. Recently, a series of experiments with 
ultracold atoms in an optical lattice was per- 
formed on a single-site resolved 1D Hubbard 
chain, leading to the observation of the frac- 
tionalization of spin and charge quantum 
numbers at equilibrium (72), the modification 
of the SDW wave vector by density doping and 
by spin polarization (73), and the study of sim- 
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ultaneous spin and charge dynamics outside 
the Luttinger liquid regime that result from a 
deconfinement-induced quench (J4). These ex- 
periments demonstrated the ability to perform 
quantum simulation of interacting fermions in 
one dimension with unprecedented control in 
cold-atom systems, but they did not measure 
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the collective low-energy excitation spectrum 
inherent to spin-charge separation. 

The excitation spectrum of the charge (den- 
sity) mode of fermionic atoms confined to 
quasi-1D tubes has been previously measured 
for fixed (75) and variable interaction strengths 
(16). These experiments used two-photon stim- 
ulated Bragg spectroscopy (Fig. 1, A and B) to 
impart an observable momentum fig, with 
energy fw, while keeping the internal state 
of the atom unchanged (17-20) (f is the re- 
duced Planck’s constant). The response of the 
1D gas at a particular g and w is related to the 
dynamic structure factor (DSF) S(q, ), which 
characterizes the low-energy excitation spec- 
trum for g<kp, where ky is the Fermi wave 
vector. In our previous work the charge-mode 
structure factor S,(g, m) was measured and 
quantitatively compared with theory with good 
results (16). Measurement of the spin-wave 
spectrum S,(g, w) remained out of reach be- 
cause, without appropriate modifications, such 
a measurement induces single-photon scatter- 
ing events that produce substantial atom loss. 

We have developed improvements to our 
implementation of Bragg spectroscopy to re- 
duce spontaneous scattering to an acceptable 
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Fig. 1. Spin and charge excitations from Bragg spectroscopy. (A) Partial energy-level diagram of °Li 
showing relevant transitions and laser detunings for spin (Ag, violet) and charge (A, red) excitations. 
(B) Relative orientation (6,.,) of each Bragg beam (1 and 2) with respect to the axis perpendicular to 
the 1D tube direction. A momentum transfer q = ky - k> =(.2 ke for the central tubes is delivered to the 
sample for a given relative detuning m = w — 2 between the beams. (C and D) Diagram of the charge 
and spin excitations, showing an excitation of (C) a particle-hole pair and (D) a spinon pair. The effect on 
the total density p(z) and spin density o(z) is shown for each case at the bottom. The excitations are 
depicted, for clarity, as starting from a classical zero-temperature antiferromagnetic ground state in the 


strongly repulsive regime. 
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level while selectively exciting either the SDW 
or the CDW with tunable repulsive interac- 
tions. The measurements are compared with 
the TLL theory, which describes the low- 
energy excitations of the more general Yang- 
Gaudin model of a spin-1/2, 1D Fermi gas with 
repulsive delta-function interactions in the 
continuum limit (21, 22). They provide a 
quantitative test of spin-charge separation 
in the Luttinger liquid regime. Additionally, 
we quantitatively show that our results for 
S.(q, @) provide evidence for deviations from 
the linear TLL theory caused by low-energy 
back-scattering interactions, which are usu- 
ally neglected to obtain a linear spin-mode 
dispersion (5). 

Bragg spectroscopy is well suited to study- 
ing spin-charge separation because the CDW 
or SDW may be isolated by the choice of de- 
tuning of the Bragg beams from resonance 
with an electronic excited state (Fig. 1). The 
detuning determines the sign of the light-shift 
potential, which can be used to create a sym- 
metrical light shift that exclusively excites 
charge waves or an asymmetrical potential 
that only excites spin waves. For a system 
composed of a balanced mixture of two spin 
components (ft, |) we can identify two inde- 
pendent contributions to the DSF, S;; and Sy, 
and thus define a charge- and spin-density 
DSF given by (5) 


Ses(q,@) = 2[S4(g,) + S1,(g, 0)] 


where the “+” sign corresponds to charge and 
the “—” sign corresponds to spin. At zero tem- 
perature, the momentum transfer to the sys- 
tem from the Bragg beams is given by (20, 23) 


1 1 2 
P(q, @) & { Sy 4 Ss 
(q,®) (3 ;| tt AVA, tt 


where A, is the relative detuning of the Bragg 
beam from the excited state with respect to 
each ground spin state o. If the condition 
A; = A, > Ay, is satisfied, where A, is the 
splitting of the spin states, then P(g, w) ~< 
S.(q, ©) and a CDW is excited, as depicted in 
Fig. 1C. On the other hand, if A; = —A, = 
|A;,|/2, then P(q, @) o S.(g, @) and an SDW is 
excited, as depicted in Fig. 1D. The detuning 
required for measuring S,(q, w) is thus fixed by 
A;,, unlike in the case for measuring S,(q, ), 
where, in principle, the detuning may be ar- 
bitrarily large. In the finite temperature case, 
a reverse Bragg process must also be con- 
sidered, for which the momentum transfer is 
modified as P(g, w) & S(g, w) — S(-q, -@) = 
S(q, w)[1 - exp(-ho/kgT)] (24, 25), where kz 
is the Boltzmann constant. 

To reduce spontaneous scattering during 
the Bragg measurement, the ratio of A,, tol, 
the linewidth of the transition, must be in- 
creased. We approximately doubled A;, by 
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choosing |1) and |3) as our pseudo-spin-1/2 
states, rather than |1) and |2) as used previous- 
ly (states |1), |2), and |3) are the three lowest 
hyperfine states of °Li) (16, 19). For the ex- 
citation of the SDW, we took the additional 
step of reducing I’ by detuning the Bragg beams 
from the 3P3,. excited state at a wavelength of 
323 nm, rather than the usual 671-nm transi- 
tion to the 2P;,. state that we use to excite the 
CDW (see Fig. 1A). The spontaneous-decay 
linewidth of the ultraviolet transition is nearly 
eight times smaller than that for the red tran- 
sition (26). We compensate for the difference in 
wavelength by simply adjusting the angle be- 
tween the Bragg beams @, ~ 4.5° and; ~ 2.2°; 
see Fig. 1B), such that for both cases, the Bragg 
wave vector is parallel to the tube axis and has 
a magnitude |g| = 1.47 um’, corresponding 
to 0.2k» for a peak-occupancy tube. Thus, the 
net effect of this change is to further reduce 
the rate of incoherent scattering. The combi- 
nation of these two steps reduces the sponta- 
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neous scattering by more than a 10-fold factor 
for a given Bragg coupling, as compared with 
(16), and is sufficient to measure S,(q, ©). 

A more detailed description of our exper- 
imental methods may be found in the sup- 
plementary materials (27). We prepare a 
spin-balanced mixture of °Li atoms in the two 
energetically lowest hyperfine sublevels, states 
|1) and |2), and confine them in an isotropic 
optical trap. We evaporatively cool the atoms 
to a temperature T ~ 0.17;, where 7p is the 
Fermi temperature. We create an effectively 
1D system that realizes the Yang-Gaudin model 
by loading the atoms into a 2D optical lattice 
with depth of 15£,, where E, is the recoil energy 
of a lattice photon of wavelength 1.064 um. 
The resulting trap configuration is an array 
of quasi-1D tubes that are elongated in the 
axial dimension with an aspect ratio of ~170. 

The number of atoms per tube is nonuni- 
form across the ensemble of tubes because of 
the Gaussian curvature of the optical beams. 


Fig. 2. Bragg spectra. Normalized 
Bragg signals related to S,(q, w) (red 
triangles) and S;(q, @) (blue circles) 


aldo 


500 F 
for the range of 3D scattering length a 
from 0 to 500dp. Each data point is the 
400 average of at least 20 separate exper- 


imental shots. Error bars represent 
standard error, obtained by bootstrap- 
ping (40). Vertical dashed lines show 
the extracted peak frequency ap, for 
the noninteracting case (black) and the 
strongest probed interactions for the 
spin and charge modes (blue and red, 
respectively). Solid lines are the 
calculated Bragg spectra for a global 
temperature T = 250 nK with no 
additional fitting parameters other than 
overall scaling. Theory includes the 
nonlinear effects of band curvature in 
the charge mode and back-scattering 
in the spin mode (for linear theory, see 
figs. S7 and S8). Deviations from 
theory at high frequency may be due 
to unaccounted-for corrections of 
order q°. 
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Fig. 3. Spin-charge separation. y 
Peaks of measured Bragg 
spectra for charge (red trian- 
gles) and spin (blue circles) 
configurations for a ranging 
from 0 to 500d. Peak fre- 
quency values were determined 
through fits of a parabolic 
function to the data points 
above 50% of the maximum 
measured value, and error bars 
are statistical standard errors 
of the relevant fit parameters. 
The corresponding speed of 
sound Vp = @p)/q is given by the 
right axis. The upper horizontal 
axis gives the interaction 
strength in terms of the Lieb- 
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Liniger parameter y* evaluated at the center of a tube with an occupancy of 30 atoms. Lines show the 
calculated values for w, for the charge and spin modes (dash-dotted red and dashed blue lines, respectively). 
Symbols for a = 0 and 100d have been slightly displaced horizontally from one another for clarity. 
Nonmonotonicity in the charge-mode data and theory at low interaction is caused by small residual 
differences in the number profiles prepared at different interaction strengths. Nonmonotonricity in the spin-mode 


theory is likely a consequence of neglecting the effects of band curvature, a q° correction. 
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Fig. 4. Dispersion of SDWs and CDWs. (A and B) The 1/e? axial width of outcoupled atoms after a Bragg 
pulse and 150-us time of flight for (A) charge (do, red triangles) and (B) spin (ds, blue circles) excitations, 
with a ranging from O to 500ap. The widths are the Gaussian fits to the positive outcoupled signal at ap. 
Error bars are standard errors determined by bootstrapping for at least 20 independent images (40). The 
horizontal axis gives the Lieb-Liniger parameter y* calculated for a median tube occupancy of 30 atoms 
(27). The blue dashed line shows an estimation for d, derived from the finite lifetime of the spin bosons. 
(C to F) Representative samples of column-density (p,) images of the atom cloud after the Bragg pulse and 
time of flight. Shown are the charge-mode excitations with (C) a = 0 and (D) a = 500ap, and the spin-mode 
excitations with (E) a@ = 0 and (F) a = 500ap. Each frame corresponds to 40 um by 65 um. 
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The number distribution from tube to tube is 
also dependent on interaction strength; the 
central tube occupancy is highest for a non- 
interacting gas and decreases as repulsive 
interactions increase. We actively control for 
these variations by applying a focused repul- 
sive green (532 nm) laser beam during the 
lattice ramp up along each of the three or- 
thogonal axes (28). Varying the depth of this 
harmonic antitrapping potential allows us to 
adjust the amount of confinement produced 
by the optical lattice and thus make the den- 
sity profiles comparable between different in- 
teraction strengths while keeping the total 
atom number constant. We measure the tube 
occupancy by taking in situ phase-contrast 
images of the atom cloud (29) and perform- 
ing an inverse Abel transform to obtain the 
3D distribution. A typical ensemble consists 
of a total of 6.5 x 10* atoms, has a peak tube 
occupancy of ~50 atoms, and has a most 
probable tube occupancy of ~30 atoms. 

We perform Bragg spectroscopy by apply- 
ing the pair of Bragg beams on the atoms in a 
200-us pulse. The intensity per beam is fixed 
to limit the loss of atoms caused by spontane- 
ous scattering to 6 to 8% during the spin-mode 
measurement and to ensure that the momen- 
tum transfer is in the linear-response regime 
for either mode over the entire range of in- 
teraction strengths that we study (27). There is 
no discernible atom loss during the charge- 
mode measurement. Immediately after the 
Bragg pulse, the atoms are released from the 
lattice and are imaged using phase-contrast 
imaging after 150 us of time of flight, after 
which atoms receiving the Bragg kick are 
noticeably displaced from the center of the 
cloud (fig. Sl). We define the Bragg signal 
to be proportional to the number of out- 
coupled atoms. 

The interaction strength is readily tunable 
using the Feshbach resonance between states 
|1) and |3) located at 690 G (30). The 3D scat- 
tering length a may be tuned between a = 0 
and a = 500dp, where Qp is the Bohr radius, 
without appreciable atom loss. Figure 2 shows 
the measured (symbols) and calculated (solid 
lines) Bragg spectra for both modes in the range 
of a from 0 to 500d. Our DSF calculations take 
into account the effect of the inhomogeneous 
density that stems from the harmonic con- 
finement along each tube by use of the local 
density approximation (LDA). The strength 
of interactions is density dependent and is 
given by the dimensionless Lieb-Liniger pa- 
rameter y = mg (a)/h’ yp, where g;(q) is the 
coupling strength of the quasi-1D pseudopo- 
tential (37) and m is the atomic mass. The 
local density pyp determines the local Fermi 
velocity and momentum (v; and hkg), the 
Luttinger parameters (K,), and the local 
velocities of the charge and spin waves (v, 
and v,, respectively). Because the Bragg signal 
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is proportional to the total transferred mo- 
mentum, we sum up the local values of the 
DSF along each tube, by invoking the LDA, 
to obtain the calculated spectra. Finally, we 
account for the frequency broadening caused 
by the finite duration of the Bragg pulses. A 
global temperature of 250 nK is the only free 
parameter in this model, other than an inde- 
pendent normalization of each calculated and 
measured spectrum. 

To calculate the charge and spin DSFs, we 
use the exact Bethe ansatz solution of the Yang- 
Gaudin model at zero temperature (2/, 32). 
For |g| < Xp, the low-energy charge and spin 
excitations have approximately linear disper- 
sion and, in this approximation, S.5(q,@)%< 
lg| 3(@ — Ves) . However, at finite tempera- 
ture and q, these DSFs are broadened when 
nonlinear effects are considered (33). For the 
strength of interactions probed in this experi- 
ment and for T « Ty, S.(g, @) is well approx- 
imated by the noninteracting DSF because the 
latter also exhibits a particle-hole excitation 
spectrum with width °<q’. As in (16), the effect 
of interactions is accounted for by replacing 
ky with k, = m*v,/h, where m* is the effective 
mass. The leading correction to the spin-mode 
dispersion at finite temperature is a conse- 
quence of a low-energy back-scattering pro- 
cess that is expected for contact interactions 
in 1D (34). Here, distinguishable spins per- 
mute between the two Fermi points by ex- 
changing 2k,. This process is exclusive to 
the spin sector and disrupts the linearization 
of the spin dispersion in the bosonization 
approach of TLL theory (5, 32). We obtain 
the retarded spin-spin correlation function 
at finite temperature from the dressed spin- 
boson propagator (35). By comparing our mea- 
surements to the nonlinear Luttinger liquid 
(NLL) theory, we find that accounting for 
nonlinearities caused by back-scattering is 
necessary to model the spin Bragg spectra, 
particularly for large interactions. The linear 
TLL model fails to reproduce the observed 
high-frequency tails of the spin-mode Bragg 
spectra (27). 

The frequency at which the Bragg signal 
reaches a maximum, @,, corresponds to the 
most probable value of the mode velocity, 
Up = @,/g, in the ensemble. We determine 
the peaks of each of the measured spectra by 
fitting a parabola to the data points that are 
above 50% of the maximum measured value 
for each spectrum. The locations of the peaks 
of the spectra obtained for our range of inter- 
action are shown in Fig. 3 along with the 
peaks of the calculated spectra for each mode, 
which are in excellent agreement. For the 
noninteracting gas, the spin and charge col- 
lective modes have the same speed, resulting 
in nearly identical measured spectra for the 
two cases (27). The congruence between the 
two spectra also confirms that the atom loss 
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suffered during the spin-mode measurement 
has no discernible effect on the measured Bragg 
spectrum. As the strength of the interaction 
is increased, the charge-mode velocity v, in- 
creases, whereas the spin-mode velocity v, 
decreases. This is seen in the shifts of the 
peaks of the two spectra: to a lower frequency 
for the spin mode, and to a higher frequency 
for the charge mode. 

We further explored the NLL regime by ex- 
tracting the axial width of the out-coupled 
atom packet after time-of-flight expansion, 
as shown in Fig. 4 as functions of interaction 
for both modes. As expected, the out-coupled 
widths increase with y for measurements of 
the spin mode, whereas they remain approx- 
imately constant for the charge mode. We are 
able to model the increase in the out-coupled 
width for the spin mode by calculating the 
spread in velocities implied by the finite spin- 
boson lifetime due to back-scattering (27). 

Having harnessed the tunability of inter- 
actions available in the cold-atom setting, we 
reveal the role of interactions in spin-charge 
separation by tuning between a spin-charge 
separated regime and one where there is no 
separation. Further, the selectivity of the Bragg 
process in exciting either the CDW or the SDW 
allows us to provide a clear demonstration of 
the division of the TLL Hamiltonian into dis- 
tinct spin and charge sectors. Bragg spectros- 
copy may be used to probe the ultracold-atom 
TLL beyond the demonstration of spin-charge 
separation contained in this work. Measure- 
ments with variable g can be conducted to 
further study the NLL and to benchmark cal- 
culations that include effects of band curva- 
ture and spin-charge coupling (33, 35, 36). 
Additionally, at increased temperatures and 
interactions, a spin-incoherent Luttinger liq- 
uid is expected, which supports a propagating 
charge mode but not a spin mode (37, 38). 
Spin-imbalanced mixtures and attractive in- 
teractions are also of interest and are accessi- 
ble by using this technique (39). Experiments 
with shallower lattices will allow for the study 
of dimensionality effects due to tunneling be- 
tween tubes (5). It is increasingly clear that the 
oft-admired mathematical elegance of 1D many- 
body physics is well complemented by the 
purity and tunability of ultracold atoms. 
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A photonic integrated circuit-based 
erbium-doped amplifier 


Yang Liu’+, Zheru Qiu?“t, Xinru Ji”, Anton Lukashchuk?”, Jijun He"?, Johann Riemensberger’~, 
Martin Hafermann, Rui Ning Wang??, Junqiu Liu’, Carsten Ronning’, Tobias J. Kippenberg*?* 


Erbium-doped fiber amplifiers revolutionized long-haul optical communications and laser technology. 
Erbium ions could provide a basis for efficient optical amplification in photonic integrated circuits but 
their use remains impractical as a result of insufficient output power. We demonstrate a photonic integrated 
circuit-based erbium amplifier reaching 145 milliwatts of output power and more than 30 decibels of small- 
signal gain—on par with commercial fiber amplifiers and surpassing state-of-the-art III-V heterogeneously 
integrated semiconductor amplifiers. We apply ion implantation to ultralow-loss silicon nitride (SizN,) photonic 
integrated circuits, which are able to increase the soliton microcomb output power by 100 times, achieving 
power requirements for low-noise photonic microwave generation and wavelength-division multiplexing optical 
communications. Endowing Si3N, photonic integrated circuits with gain enables the miniaturization of various 
fiber-based devices such as high—pulse-energy femtosecond mode-locked lasers. 


he invention of erbium-doped fiber 
amplifiers (EDFAs) in the 1980s (J, 2) 
revolutionized long-haul optical commu- 
nications, profoundly affecting the trans- 
mission of information in society. EDFAs 
have replaced complex and bandwidth-limited 
electrical repeaters, enabling transatlantic fiber- 
based optical communication networks (3). 
Erbium amplifiers have a number of properties 
highly suitable for optical communications, such 
as a broadband gain of ~1550 nm-wavelength 


Fig. 1. Integrated erbium-implanted 
SizN, waveguide amplifier. (A) A Si3N4 
waveguide amplifier with erbium ions 
optically excited by a 1480-nm pump. 
(B) Optical image of a 0.5-m-long Er: SisNq 
waveguide coil. (€) Schematics of key 
fabrication processes and SEM images of 
waveguide cross sections overlaid with 
the simulated optical transverse electric 
(TE) mode before and (D) after ion 
implantation. (E) Vertical profiles of 
calculated erbium concentration (green), 
simulated optical TE mode intensity (red), 
and measured concentration (blue) by 
Rutherford backscattering spectrometry 
(RBS) across the waveguide center. 

(F) Measured optical losses before ion 
implantation, as-implanted, and after 
annealing. (G) Optical image of an 
intensely pumped Er: Si3N4 chip 
butt-coupled with optical fibers. 


A 
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that coincides with the lowest optical fiber 
propagation loss band, a long millisecond (ms)- 
lifetime of the parity forbidden intra-4f shell 
“Ts/2—“hg/2 transition that leads to slow gain 
dynamics and negligible interchannel cross- 
talk in multiwavelength amplification, high- 
temperature stability, and a low-noise figure 
approaching the quantum mechanical limit 
of 3 decibels (dB) for phase-insensitive ampli- 
fication (4). Today, EDFAs have underpinned 


the development of narrow-linewidth and mode- 


locked lasers that are widely deployed in ap- 
plications such as coherent communications 
(3), interferometric sensing, and optical fre- 
quency metrology (5). Rare-earth ion doping 
(6, 7) can provide the basis for compact erbium- 
doped waveguide amplifiers (EDWAs) (8). In 
the 1990s efforts were made to implement 
EDW4As on the basis of oxide glass waveguides 
(9, 10), but these were hindered by large wave- 
guide background losses, large device foot- 
prints, and incompatibility with contemporary 
photonic integrated circuits, and thus were 
ultimately abandoned. Interest in EDWAs re- 
emerged with the SizsN, complementary metal- 
oxide-semiconductor-compatible photonic 
integrated circuit platform, with advantages 
over silicon including a wider transparency 
window (11), absence of two-photon absorp- 
tion in telecommunication bands, lower tem- 
perature sensitivity, high power handling of 
>10 watts (12), and most crucially, low propa- 
gation losses of <3 dB/m, maintained over 
meter-scale lengths (13). 
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One challenge in realizing photonic inte- 
grated circuit-based erbium amplifiers is the 
limited achievable gain stemming from con- 
straints in doping concentration due to co- 
operative upconversion (1/4). This limitation 
necessitates waveguides with low propagation 
losses and long waveguides with lengths rang- 
ing from tens of centimeters to meters, to 
achieve large gain and high output power. 
Although net gain has been shown, all prior 
work using erbium-doped Al,O03 (15, 16), TeOz 
(17), and erbium chloride silicate nanowires 
(18) typically achieved low output power of 
<<1 mW, below the level demanded by many 
applications and below that achieved with 
heterogeneous III-V integration in silicon 
photonics (19, 20). 

We demonstrate a photonic integrated 
circuit-based erbium-implanted Si3N, (Er: 
SigN,) amplifier based on meter scale ultralow- 
loss SigN4 waveguides (Fig. 1A). We fabricate 
densely-packed Si;N, spiral waveguides of 0.5m 
in length (Fig. 1B) with 3-um gap spacing 
and cross sections measuring 0.7 x 2.1 um”, 
achieving a compact footprint of only 1.2 x 
3.6 mm? and a propagation loss of <5 dB/m 


through the photonic damascene process (73) 
(Fig. 1C and fig. S1). We use devices without 
top cladding to enable direct ion implanta- 
tion into the waveguide core (Fig. 1D). Next, 
we apply ion implantation (6)—a wafer-scale 
process that benefits from much lower coopera- 
tive upconversion compared with cosputtered 
films (J4)—to the Si3N, integrated circuits 
(fig. S1). To achieve a large overlap of [ = 50% 
between embedded ions with the fundamental 
optical waveguide mode, we use three succes- 
sive implantation steps with ion energies of 
2.0, 1.416, and 0.966 mega-electron volts 
(MeV) and corresponding ion fluences of 
4.50 x 10”, 3.17 x 10", and 2.34 x 10% cm™”, 
respectively. The simulated concentration pro- 
file (Fig. 1E, green) with a maximum depth 
of 400 nm in SisN, (22) matches well with 
Rutherford backscattering spectrometry (RBS) 
measurement (fig. $2). 

Upon implantation the waveguide back- 
ground loss increases from <5 to 100 dB/m 
because of implantation defects (Fig. 1F). The 
background loss outside the erbium absorp- 
tion band is reduced after annealing at 1000°C 
in oxygen for 1 hour and approaches the same 


level as undoped waveguides (21). Green lumi- 
nescence is observed stemming from cooper- 
ative upconversion (Fig. 1G). Notably, such a 
loss only contributes <2.5 dB of background 
attenuation for a 0.5-m-long waveguide, ap- 
preciably lower than the 30 dB passive loss 
for waveguides of equal lengths in prior work 
(15, 16), which depletes the pump early and 
prevents efficient amplification. Moreover, 
we investigate selective masking (using a photo- 
resist) during ion implantation (figs. S4 and S5). 
Crucially, lateral claddings can provide me- 
chanical support to mitigate severe cross sec- 
tion deformations observed in the waveguides 
without lateral claddings (figs. S1 and S6). 
The emission and absorption cross sections 
[oe(A) and o,(A)] are important parameters 
for modelling gain performance and extract- 
ing cooperative upconversion coefficients. 
Figure 2A shows the emission spectrum when 
pumping the *1,;/. state (980 nm) in a 0.46-cm- 
long Er: SizN, waveguide, from which the 
emission cross section (Fig. 2C) can be derived 
Ge(A) 
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Fig. 2. Light emission and absorption properties of Er: Si3N, photonic wave- 
guides. (A) Fluorescence emission spectrum (red) from an Er: SisN4 waveguide. 
(B) Measured resonance linewidths of an Er: SisN4 microring resonator for extracting 
erbium absorption. The insets exemplify two resonances with intrinsic loss of 2 and 
0.08 dB/cm. (C) Extracted emission and absorption cross sections. (D) Experimental 
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is the photoluminescence lifetime fitted from 
temporal measurement (Fig. 2, D and E), as- 
suming the modification (~0.93) of the density 
of states has limited effects on the emission 
rate (27). The wavelength-dependent erbium 
absorption loss og; is derived from the intrinsic 
resonance linewidths of an Er: SizN, micro- 
ring resonator and is calibrated by optical 
frequency domain reflectometry measurements 
(fig. S3). The loss is proportional to the erbium 
absorption cross section given by oa(A) = — 
in which Np is the effective peak erbium ion 
concentration and T is the overlap factor (23). 
The absorption cross section 6, (A) is obtained 
by scaling its peak value to approximately 
that of the derived o¢(A), which in turn gives 
No = 1.35 x 107° em™?, indicating that al- 
most all the incorporated erbium ions are 
optically active compared with the RBS mea- 
surement (Fig. 1E). Comparisons of measured 
gain coefficients are made under different 
ion concentrations (Fig. 2F), pump wave- 
lengths, and waveguide widths (fig. S7). 
Next, we demonstrate high output power 
and large net gain upon 1480 nm pumping in 
a 0.21-meter-long Er: Si;N, waveguide with an 
erbium concentration of ~3.25 x 10° cm® 


(Fig. 3A), showing a broadband on-chip net 
gain of 30 dB (Fig. 3B). Figure 3C shows mea- 
sured net gain at 1550 nm along with simu- 
lations reproduced for parameter fitting. The 
on-chip output power reaches 60 mW for 
0.07 mW input, indicating an off-chip net gain 
of 24 dB considering ~5.8-dB coupling loss. 
The on-chip output power reaches 145 mW 
for an increased input power of 2.61 mW at 
245 mW coupled pump power (Fig. 3D, inset), 
indicating an on-chip power conversion effi- 
ciency approaching 60% (signal power incre- 
ment divided by pump power). Similar gain 
performance is achieved in a 0.5-meter-long 
Er: SizN4 waveguide with a lower erbium con- 
centration of ~1.35 x 107° cm® (figs. S9 and 
S10). A noise figure of ~7 dB is measured ata 
net gain of >20 dB (fig. S11) and the excess 
EDWA RIN approaches the same level (ca. 
-156 dBc/Hz) as the EDFA for >10 MHz 
offset Fourier frequency (fig. $12). Simula- 
tions suggest that the low cooperative upcon- 
version (Cx = 3.0 x 10 cm’s~! and 13.0 x 
10-18 cm?s"! fitted for low and high concen- 
trations, respectively) does not impose limita- 
tion on the EDWA gain performance at high 
pump power >245 mW (21). The observed 


fluctuations in measured gain are caused by 
the gain clamping effect when parasitic lasing 
occurs in an optical Fabry-Pérot cavity formed 
by waveguide facets (fig. S13). 

As an example of the utility, we apply the 
Er: SigN4 EDWA to the amplification of soli- 
ton microcombs (Fig. 4A) (24), which typi- 
cally exhibit low conversion (~1%) efficiency 
(25) and require amplification in virtually all 
applications. First, a 19.8-GHz single-soliton 
microcomb generated in a Si;N, Euler-bend 
racetrack microresonator (26) with 0.08 mW 
output power [Fig. 4B, panel (i)], is amplified 
to 8.4 mW of off-chip power by the EDWA 
[panel (ii)]. This leads to a reduction in single- 
sideband phase noise of a soliton repetition 
rate to ca. -144 dBe/Hz at >1 MHz offset Fourier 
frequency, compared with —-104 dBc/Hz (before 
amplification) limited by photon shot noise 
(Fig. 4C). Comb line spikes [panel (ii)] are 
induced by the regenerative feedback from 
EDWA chip facets. A commercial EDFA is 
deployed to amplify the same soliton micro- 
comb to a similar power level [panel (iii)], 
giving identical performance. 

Second, we demonstrate a telecommunica- 
tion experiment using an amplified soliton 


FRC a PM oOo 35 T T T T T T T T T 
+ ,, [Backward = 
1480 nm Pump 1 ba PM < 30 i i > shige laes z| 
FPC are 7 a A Ms 
Forward 1500 nm = ‘ e Le 
@® 25 é ofe e 
2 2 e 
FPC 2 ~ 
lll, Baa Coupl 20+ ¢ baa | 
PIGRSIESSne 4550/1480 nm 8 oe 
1520-1580 nm Jenga O45 i i ; i i \ \ f 1 
1530 1540 1550 1560 1570 1580 
Wavelength (nm) 
Cc TT TTT T D 160 T T 
30+ A Exp. w/ Sig,,: 2.61 mW See 
Sim. 140 ereetl 
@ = Exp. w/ Sig, : 0.31 mw = 
Sim. S 
20 i | | Exp. w/ Sig,,: 0.07 mW 120 oD e7 
Si = A 
im. 5 
=~ Ae a 
a ne] 
= 2 100 + & 4 
££ 105 = 5- e 
© Off-chip net gain ro 1 A 
[O) = [o) 
2 ; & 80 | 
= Coupling loss < ms e 
ST We background | S 7 
is) ackground loss se 4 
¢ e < 60 1530 1540 1550 1560 1570 e. 
e) Erbium absorption 12) Wavelength (nm) A 
= . 
es e 4 
10 40 A Exp. w Sig, : 2.61 mw A 
@ Exp. w/ Sig,: 0.31 mw aA®%u 
20 m Exp. w/ Sig,: 0.07 mw iz A e. if 
-20 ‘ ’ ‘ 4 eF 
aren! purr \ 0 a t-O16-6-4- A 4 , \ 
107 10° 10' 10? e 10' 10? 


On-Chip Pump Power (mW) 


Fig. 3. Intense optical amplification in an Er: SizN4, waveguide amplifier. 
(A) Experimental setup for optical amplification with a free-space filter for 

pump suppression. ATT, tunable optical attenuator; PM, power meter; OSA, 
optical spectrum analyzer. (B) Measured gain spectrum. (€) Measured 


Liu et al., Science 376, 1309-1313 (2022) 17 June 2022 


On-Chip Pump Power (mW) 


(scatters) and simulated (curves) gain at 1550 nm. The color-shaded areas 
indicate the region of off-chip net gain and sources of loss. (D) Measured 
output power at 1550 nm. The inset shows the calibrated optical spectrum 
at 145 mW signal output. 


3 of 5 


RESEARCH | REPORT 


Resonator 


Microwave generation 


Amplified 
Microcomb 


WDM communication system 


(Coh. Rx) Rx 


B T T T T T Sb ina pani T Cc -40 T aT meer 7 oe 
-30F (i) 20 GHz microcomb input eee L-band | Input 20 GHz DKS 
Off-Chip Input:0.08 mw mM ATTA -50 Amplified by EDFA 
j IHN A 
— PT | | | |) j — Amplified by EDWA 
wi j il il | AAA -60 --- Estim. shot noise floor] 
| i I ---.- Estim. shot noise floor 
=e 70 - -@-- FSUP floor 
= fom, a 
N 
 -70 | z 
5 -10} (ii) Si,N, EDWA output aq 60 
2 Off-Chip Net Gain:16.6 dB (+ 3.5 dB) = 
LL I) oO 
oa I x2) -90 
8 " 2 
é signe tea g OT IM ND al tegen nten ——am——— 
a 
2 oe & -110 
& “50p - | 
5 -10} (ili) EDFA output —™ | &-120 
EDFA Net Gain:16.8 dB Ke E rah 
ij . 100 mm| -130 
a 
HH - 
Mi i | | | | _ 140 
| 150,00 eee 
i 1 2 2 
1530 1540 1550 1560 1570 1580 ° 4 6 ° ’ 
10 10 10 10 10 
Wavelength (nm) 
D 10 ——=""100. GHz microcomb input: - 9.0 dBm : 
& — Si,N, EDWA output: 15.6 dB 
Ss 0 
o 
5 
a -10 
@ 
aS) 
é -20 
a Hl : 
ia | 1 
5 | Hy H 
fad 
(e) 
191 192 193 It 192.2 THz 192.9 THz 193.9 THz 
Frequency (THz) BER =7.8 x10° BER = 1.2 x10° BER = 1.48 x10" 


Fig. 4. On-chip amplification of soliton microcombs. (A) Experimental setup 
for soliton amplification, microwave generation, and wavelength-division multi- 
plexing optical communication. FBG, fiber Bragg grating; DEMUX, demultiplexer; 
LO, local oscillator. (B) Amplification of 19.8 GHz microcomb. (i) Generated 
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is measured with ~3.5 dB insertion loss of fiber couplers and filters. (©) Measured 
single-sideband phase noises of generated microwave signals. (D) Optical 
spectra of amplified 100 GHz soliton microcomb and input. (E) Measured bit 
error ratio (BER). Blue (red) dashed line corresponds to 7% soft decision 

(20% hard decision) forward error correction overhead. (F) Reconstructed QPSK 
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microcomb with 100-GHz channel spacing 
in which a commercial wavelength-division 
multiplexer allows for individual access to 
the comb lines (Fig. 4, Aand D). We modu- 
late optical carriers with quadrature phase- 
shift keying (QPSK) bit stream at a symbol 
rate of 10 gigabaud and coherently detect 
the data transmitted over 1 km of standard 
single-mode fiber (27). More than 0.1 mW of 
optical power per line enables reliable data 
transmission for >20 channels (Fig. 4, E and 
F). We envision our EDWAs to act either as 
compact booster amplifiers after signal mod- 
ulation or as pre-amplifiers at receivers, meet- 
ing the requirements of the optical power 
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and SNR budget. The EDWA is also capable 
of amplifying mode-locked lasers with 1 pJ 
pulse energy and 100-MHz repetition rate 
(fig. S14). 

Our photonic integrated circuit-based EDWA 
can provide >145 mW on-chip output power 
and >30 dB of small-signal gain. The technique 
of ion implantation in Si,N, can provide a 
basis for the gain medium in a variety of inte- 
grated laser sources such as high-power soli- 
ton microcombs, low-noise rare-earth-ion-based 
CW lasers, femtosecond mode-locked lasers (27) 
or cavity soliton lasers (28). Moreover, it could 
allow for co-doping of other rare-earth ions 
such as ytterbium (emission at 1 um) and thu- 


17 June 2022 


lium (2 um), thereby providing optical gain in 
other wavelength regions. Further, this active 
Si3N, photonic platform is compatible with 
heterogeneous integration of thin-film lithium 
niobate, enabling the combination of both 
high-speed electro-optic modulation and am- 
plification on the same chip for coherent 
communications (29) or radio frequency pho- 
tonics (30). 
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TRANSCRIPTION 


Structural basis of nucleosome retention during 


transcription elongation 


Martin Filipovski, Jelly H. M. Soffers', Seychelle M. Vos”, Lucas Farnung** 


In eukaryotes, RNA polymerase (Pol) II transcribes chromatin and must move past nucleosomes, often 
resulting in nucleosome displacement. How Pol Il unwraps the DNA from nucleosomes to allow transcription 
and how DNA rewraps to retain nucleosomes has been unclear. Here, we report the 3.0-angstrom cryo-electron 
microscopy structure of a mammalian Pol II-DSIF-SPT6-PAF1c-TFIlS-nucleosome complex stalled 54 base 
pairs within the nucleosome. The structure provides a mechanistic basis for nucleosome retention during 
transcription elongation where upstream DNA emerging from the Pol Il cleft has rewrapped the proximal side of 
the nucleosome. The structure uncovers a direct role for Pol Il and transcription elongation factors in 
nucleosome retention and explains how nucleosomes are retained to prevent the disruption of chromatin 


structure across actively transcribed genes. 


ranscription requires the passage of RNA 

polymerase (Pol) II through chromatin. 

The nucleosome is a substantial barrier 

to Pol II because of an extensive network 

of contacts between histone proteins and 
the DNA that is wrapped around them (J). In 
vitro, Pol II is either unable to move past these 
obstacles or needs to displace at least one H2A- 
H2B dimer to transcribe through a nucleosome 
(2-6). By contrast, in vivo experiments have 
demonstrated that most transcribed genes 
have intact nucleosomes, and only very highly 
transcribed genes show nucleosomal loss (7, 8). 
Nucleosome maintenance during transcription 
is critical for cell survival, and disruption of 
chromatin structure during transcription elon- 
gation has a major impact on transcription 
fidelity; even partial loss of nucleosomes re- 
sults in aberrant transcription initiation events 
from nonpromoter regions within gene bodies 
(9, 10). Thus, nucleosomes play a fundamental 
role in suppressing cryptic transcription events 
and restricting transcription initiation to gene 
promoters. Moreover, nucleosome displacement 
leads to a loss of posttranslational histone 
modifications that have a central regulatory 
role in demarcating transcriptionally active 
gene loci (Z1). 

Understanding how nucleosomes are over- 
come and retained during transcription is a 
fundamental objective. Recently, cryo-electron 
microscopy (cryo-EM) structures of Pol II- 
nucleosome complexes have begun to eluci- 
date how transcription through nucleosomes 
is mediated (12-15). They have not, however, 
addressed the question of how nucleosomes 
are retained during transcription. Biochemical 
and in vivo data suggest that the formation of 
an intranucleosomal Pol II containing a tem- 
plate loop (the “O-loop”) could play an impor- 
tant role in nucleosome retention (6, 16-19). 
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To investigate how Pol II traverses the nu- 
cleosome and concurrently maintains nucleo- 
some integrity, we designed a nucleosomal 
substrate that allows for transcription of 
Pol II close to the nucleosomal dyad, a posi- 
tion previously described as a major stall site 
for Pol II (20). The nucleosomal DNA tem- 
plate contains a modified Widom 601 sequence 
that lacks deoxythymidine in the template 
strand from the transcription start site to a 
position 64 base pairs (bp) within the nucleo- 
some (+64) (fig. S1A). An RNA primer can be 
annealed to a 3'-DNA overhang at the entry 
site, forming a nine-nucleotide DNA-RNA hy- 
brid to allow for primer extension by Pol II 
upon the addition of nucleoside triphosphates. 
Mononucleosomes were assembled on this 
DNA template and then the following were 
added: (i) purified Sus scrofa Pol II and Homo 
sapiens elongation factor PAFic composed of 
subunits CTR9, CDC73, WDR61, PAF1, LEO1, 
and RTF1; (ii) DRB-sensitivity-inducing factor 
(DSIF) composed of subunits SPT4 and SPT5; 
and (iii) SPT6, TFIIS, the kinase-positive 
transcription elongation factor b (P-TEFb), 
and 3’-deoxy-adenosine 5’-triphosphate (fig. 
SIB). The transcription reaction was initiated 
by the addition of cytosine 5’-triphosphate, 
guanasine 5’-triphosphate, and uridine 5'- 
triphosphate (fig. SIC). 3'-Deoxy-adenosine 
5'-trisphosphate enabled robust stalling at 
the engineered stall site while serving as a 
P-TEFb substrate to allow for phosphorylation- 
dependent complex formation (fig. S1, C and D) 
(21). Subsequently, the transcription reac- 
tion mixture was subjected to size-exclusion 
chromatography, which revealed the formation 
of a Pol II-DSIF-PAFic-SPT6-TFIIS-nucleosome 
complex (Fig. 1A and fig. S1, D to F). Analysis 
of the transcription products by denaturing 
gel electrophoresis revealed that RNA frag- 
ments corresponding to nucleosomal stall 
sites +38, +54, and +64 were the predomi- 
nant products (Fig. 1B) (20). Peak fractions 
containing the nucleosome-bound elongation 


17 June 2022 


complex were mildly cross-linked with glu- 
taraldehyde and prepared for single-particle 
cryo-EM. 

The structure of the activated elongation 
complex bound to a nucleosome was deter- 
mined at an overall resolution of 3.0 A from 
105,420 particles (Fig. 1C, figs. S2 to $4, movie 
S1, and table S1 and S3). A final composite map 
was assembled from masked refinements of 
the nucleosome, Pol II, and the PAF1 complex, 
with local resolutions ranging from 2.9 to 
11.1 A (figs. S2 to $4). Known structures of the 
complete mammalian activated elongation 
complex, TFIIS, and the nucleosome were 
placed into this map, and the structures were 
adjusted locally (22). The reconstruction shows 
DNA density extending from the upstream 
DNA toward the nucleosome. We observed a 
change in the location of the downstream 
DNA in the Pol II cleft compared with the 
DNA location of previously obtained elonga- 
tion complexes (13, 15, 21, 22) (Figs. 1, C and D, 
and 2, and fig. S4F). We de novo built the DNA 
to accommodate the observed changes in the 
DNA path. The model was real-space refined 
and shows good stereochemistry (table S3). 

High-resolution features of the DNA-RNA 
hybrid and visualization of the complete tran- 
scription bubble allowed unambiguous assign- 
ment of the nucleic acid register (fig. S4, C, G, 
and H). In our structure, Pol II has unraveled 
DNA up to nucleosomal superhelical location 
(SHL) -0.5, and the Pol II active site is located 
at nucleosomal bp +54 (Fig. 2A). It adopts a 
mixture of pretranslocated states (nucleotide- 
binding site occupied by newly extended RNA 
3’ end) and posttranslocated states (nucleotide- 
binding site not occupied) (fig. S4C). The +54 
site that we observed was selected by the 
transcribing Pol II because the engineered 
stall site is only at nucleosomal bp +64. This 
implies that stalling at +54 is a major barrier 
to Pol II. 

The transcribed upstream DNA adopts a 
new path when exiting the Pol II cleft. The 
upstream DNA projects away from Pol II 
toward the nucleosome, and ~55 bp of tran- 
scribed DNA rewraps the proximal side of the 
nucleosome from SHL -6.5 to SHL -1 (Figs. 1D 
and 2, A and B, and figs. S5 and S6). The path 
of the rewrapped DNA follows the DNA con- 
formation of a canonical nucleosome. The up- 
stream DNA stabilizes the proximal H2A-H2B 
dimer that plays a critical role in nucleosome 
retention during transcription (Fig. 2, A and 
B) (2). The DNA rewrapping also has conse- 
quences for the path of the downstream DNA. 
Usually, the downstream DNA duplex and the 
DNA-RNA hybrid emanate at a 105° angle 
from the Pol II active site, where the down- 
stream DNA lies in the cleft between the 
clamp and RPB2 (fig. S5) (22). Farther down- 
stream, the DNA contacts the Pol II jaw do- 
main of RPB5. In our structure, the downstream 
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by a 90° rotation. Pol Il, SPT5, SPT6, CTR9, WDR61, PAF1, LEO1, CDC73, RTF1, TFIIS, H2A, H2B, H3, and H4 are shown in gray, forest green, dark blue, orange, teal, 
limon, purple, dark red, pink, yellow, light yellow, red, light blue, and chartreuse, respectively. The DNA template is colored upstream to downstream as a gradient 
from white to blue, respectively. The same colors are used throughout. (D) Schematic of the complex. Unresolved downstream DNA is shown as a dotted line. 


Fig. 2. Details of nucleosome rewrapping 
during transcription. (A) Rewrapping 

of transcribed upstream DNA from 

SHL -6.5 to SHL -1 on the histone octamer 
and unwrapping of downstream DNA 

from SHL +5 to SHL +7. H3 inserts 
between rewrapped upstream and 
downstream DNA. (B) Nucleosome side 
view showing template looping with 

the transcribed upstream DNA 

rewrapping the nucleosome. Directions 

of transcription and rewrapping 

are indicated. 


DNA duplex is bent away from the jaw domain 
into the cleft by ~30°, losing contact with 
RPB5 (fig. S5B) (22). After exiting the Pol II 
cleft, the downstream DNA is then contorted 
by ~120° and is bound by the histones from 
SHL -0.5 to SHL +5 (Fig. 2, A and B, and fig. 
S5C). This DNA path differs from previous 
downstream DNA conformations of Pol II- 
nucleosome complexes in which the down- 
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stream DNA projects in a straight B-DNA 
conformation toward the nucleosome (12-15). The 
small gap between downstream DNA at SHL -0.5 
and rewrapped nucleosomal DNA at SHL -1 is 
filled by histone H3 N-terminal residues 39 
to 43 (Fig. 2A). Additionally, we observed that 
downstream DNA starting at SHL +5 to SHL 
+7 was no longer bound by the nucleosome 


and projects away from the complex (fig. S5D). 


17 June 2022 


On the upstream side of Pol II, the rewrap- 
ped DNA displaces SPT4 and the SPT5 NGN 
and KOWI1 domains of DSIF from their pre- 
viously observed binding positions on Pol II 
(Fig. 3A) (21, 23). However, we observed den- 
sities for the SPT5 domains KOW2-3, KOWx-4, 
and KOWS5 (fig. S4, K and L). This suggests 
that the partial displacement of SPT4 and SPT5 
from the tip of the Pol II clamp and from the 
RPB2 protrusion domain is required for re- 
wrapping of nucleosomal DNA and supports 
nucleosome retention, as reported biochem- 
ically (24). Consistent with this observation, 
functional genomics data show changes in 
cross-linking patterns for Spt4/5 as Pol II 
transcribes through a nucleosome (25). We 
observed a second Pol II-DSIF-PAF1c-SPT6- 
TFIIS-nucleosome complex in our data stalled 
at nucleosomal bp +38 (figs. S7 to S9 and 
tables S2 and S83). This complex shows no 
rewrapping, but SPT4 and the SPT5 NGN and 
KOW1 domains are present and are positioned 
at the Pol II clamp as observed in previous 
studies (fig. S9, F and G). 

In the rewrapped nucleosome, we observed 
interactions among Pol II, the PAFIc subunit 
CTR9, and the nucleosome (Fig. 3, B to D, 
and figs. S4, I to L). These contacts actively 
stabilized the observed nucleosome confor- 
mation by contacting the rewrapped DNA. 
First, we observed an interaction between 
the RPB2 protrusion residue lysine 427 (K427) 
and the upstream DNA (Fig. 3B). RPB2 K427 
is widely conserved among eukaryotes but is 
not found in the corresponding Escherichia 
coli RNA polymerase subunit (fig. S10). Thus, 
the protrusion domain may have evolved to 
support chromatin transcription by mediat- 
ing DNA rewrapping. Second, like previous 
Saccharomyces cerevisiae Pol II-nucleosome 
structures (13, 15), RPB1 clamp head residue 
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K203 contacts downstream nucleosomal DNA 
at SHL -0.5 (Fig. 3C). Third, we observed that 
the C-terminal end of the CTR9 trestle helix 
lies near the rewrapped DNA at SHL -1, po- 
tentially stabilizing the rewrapped DNA (Fig. 
3D). These contacts formed by Pol II and 
transcription elongation factors with the re- 


Fig. 3. Interactions and changes in elongation 
complex conformation. (A) Comparison of the 
Pol II-DSIF-SPT6-PAF1c complex on the linear 
DNA template (gray, PDB 6TED) with the 

Pol II-DSIF-SPT6-PAF1c-TFIIS-nucleosome 
structure stalled at +54 revealing displacement 

of SPT4 and SPT5 NGN and KOW1 domains 

in the rewrapped complex. The trajectory of 

the upstream DNA duplex is altered compared with 
the Pol II-DSIF-SPT6-PAFlc complex on a linear 
DNA template. SPT5 KOW2-3, x-4, and 5 

(forest green) are retained in the rewrapped 
complex. (B) Interaction of Pol II RPB2 protrusion 
domain residue K427 with nucleosomal DNA 

at the dyad. (€) Interaction of the Pol II clamp 
head (residue K203) with nucleosomal DNA. 

(D) The CTR9 C terminus is positioned near 

the rewrapped nucleosomal DNA at SHL -1. 
Density is shown in gray. 
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Fig. 4. Model of nucleosome retention. Model depicting Pol I! transcription 
through a nucleosome. After Pol II has transcribed sufficiently into the 
nucleosome, the DNA rewraps. Further transcription may result in a complete 
transfer of the nucleosome to upstream DNA. The transfer may result in a 
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nucleosome retention by stabilizing the nu- 
cleosomal DNA through multiple contacts with 
the DNA phosphate backbone. 
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tion by facilitating DNA rewrapping back onto 
the histone octamer surface (Fig. 4). This re- 
wrapping and template looping supports the 
0-loop hypothesis that was proposed on the 
basis of biochemical data (6, 16). This model 
proposes that Pol II transcribes nucleosomes 
while DNA both upstream and downstream of 
Pol II is bound to the nucleosome. Our struc- 
ture supports a loop size of ~90 bp (fig. S6), 
comparable to atomic force microscopy mea- 
surements (6). Unambiguous assignment of 
the DNA register and comparison with canon- 
ical nucleosomes showed that 30 bp of pre- 
viously linear entry side DNA rewrap the 
nucleosome from SHL -4 to SHL -1 (fig. S6), 
resulting in an upstream nucleosomal shift 
after transcription (6, 26). Such a retrograde 
movement of nucleosomes has been observed 
both in vivo and in vitro (19, 26). 

Previous structural studies have shown that 
stabilization of the proximal H2A-H2B dimer 
can be achieved by the binding of a secondary, 
“foreign” DNA molecule in trans (14, 15). By 
contrast, our structure, in which Pol II is ac- 
companied by the transcription factors DSIF, 
SPT6, and PAFic, now reveals that rewrapping 
of upstream DNA can also occur in cis, where 
the same DNA molecule rewraps the histone 
core (Fig. 4). This has wide implications for 
our understanding of the mechanism of 
nucleosome retention during transcription. 


Nucleosome transfer 
to upstream DNA 


RNA Pol Il RNA 


Upstream 
ay DNA 


Downstream 
DNA 


positional shift of the nucleosome. Pol II is shown as an outline. Metal 
A is shown as a magenta sphere. The histone octamer is shown as a 
gray circle. Nucleosomal entry bp +1 is indicated with a black arrow. 
The dyad axis is indicated as a dotted line. 
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Whereas binding of a secondary molecule in 
trans suggests a mechanism involving stabi- 
lization of the nucleosome to distal DNA, our 
structure provides evidence that nucleosomes 
are not fully disassembled during transcrip- 
tion (6, 17-19) (see the supplementary text). 

We propose a model that explains nucleo- 
some transfer to the upstream side of Pol II 
(Fig. 4). Transcription of Pol II past the dyad 
is likely to result in complete unraveling of 
the ~50 bp of downstream DNA that are 
still wrapped around the nucleosome from 
SHL -0.5 to SHL +5. Release of the down- 
stream DNA from the octamer will result 
in a complete transfer of the nucleosome to 
the upstream DNA. At this point, Pol II only 
transcribes linear downstream DNA, which 
may explain why the distal part of the nu- 
cleosome does not pose any major transcrip- 
tional barrier (12, 20). The proximal and 
distal sides of the histone octamer switch 
during the transfer process. 

Further rewrapping of upstream DNA may 
result in a reduced linker length between the 
transcribed nucleosome and the preceding nu- 
cleosome. Chromatin remodelers belonging to 
the CHD and ISWI families might be needed 
to reestablish nucleosome spacing (27). The 
chromatin remodeler CHD1, for example, could 
be readily recruited by the upstream-positioned 
PAFic subunit RTF1 to engage the retained 
nucleosome and correct nucleosome spacing (28). 

Nucleosomes are critical for transcription 
fidelity by limiting transcription initiation to 
promoter regions and by maintaining tran- 
scriptional memory through epigenetic mod- 
ifications. Our structure provides a structural 
foundation to explain how nucleosomes are 
retained during transcription elongation, and 
it also provides a framework to understand 
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the actions of the transcription machinery in 
a chromatin environment. 
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Droughts and societal change: The environmental 
context for the emergence of Islam in late 


Antique Arabia 


Dominik Fleitmann'*, John Haldon2*, Raymond S. Bradley*, Stephen J. Burns®, Hai Cheng*>*, 
R. Lawrence Edwards’, Christoph C. Raible®®, Matthew Jacobson”°, Albert Matter” 


In Arabia, the first half of the sixth century CE was marked by the demise of Himyar, the dominant power 
in Arabia until 525 CE. Important social and political changes followed, which promoted the 
disintegration of the major Arabian polities. Here, we present hydroclimate records from around 
Southern Arabia, including a new high-resolution stalagmite record from northern Oman. These records 
clearly indicate unprecedented droughts during the sixth century CE, with the most severe aridity 
persisting between ~500 and 530 CE. We suggest that such droughts undermined the resilience of 
Himyar and thereby contributed to the societal changes from which Islam emerged. 


he Himyarite kingdom was the domi- 

nant power in Arabia from the late third 

to the early sixth century CE (Fig. 1A). Its 

dissolution over the course of the sixth 

century CE marked the end of a 1400- 
year-long pre-Islamic period of supratribal 
polities in Arabia (1-6), but the reasons for 
its demise remain debated. Himyar’s decline 
was characterized by political disorder, socio- 
economic change, shifting settlement patterns 
and demography (Fig. 1, B and C), and the de- 
cline of major irrigation systems (2, 4, 5, 7, 8). 
The annexation of Himyar by Aksum (today’s 
Ethiopia) in 525 CE and frequent interven- 
tions from the Byzantine and Sasanian Em- 
pires were additional factors that contributed 
to profound social, economic, and political 
transformations during the sixth century 
CE in Arabia (J, 5, 6). Explanations for these 
changes focus on sociopolitical factors (4, 5). 
The possibility that drought played a role has 
been largely ignored (4, 6, 9, 10), despite the 
fact that Himyar and the southern Arabian 
region more widely were extremely vulnerable 
to droughts because their economy was based 
on rain-fed and irrigated agriculture (5, 6, 9). 
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The lack of highly resolved and precisely dated 
palaeoclimate records from southern Arabia 
has prevented an assessment of whether 
droughts were a factor that contributed to 
Himyar’s decline. Here, we combine hydro- 
logical, historical, and archaeological records 
from the Middle East and East Africa with a 
new stalagmite record of winter-spring pre- 
cipitation and effective moisture (EM) in 
northern Oman to demonstrate that severe 
and persistent droughts may have been an 
important factor in the changes that took 
place in southern Arabia in the sixth century 
CE (2, 4, 5, 1). 

Stalagmite H12 from Hoti Cave in northern 
Oman (57°21'E, 23°05'N; 800 m above sea 
level) (Fig. 1A and figs. S1 and S2) provides a 
well-dated hydroclimatic record that spans 
this critical period of history. Rainfall in north- 
ern Oman and most of Arabia originates 
predominantly from Mediterranean frontal 


systems, whereas summer monsoon (SM) pre- 
cipitation affects only the Yemeni highlands 
and southern Oman (72). Thunderstorms during 
summer and occasional cyclones are addition- 
al sources of rainfall. Regional precipitation 
varies between 50 and 255 mm year (Al 
Hamra; 1978-1997) and occurs in winter and 
spring (~65% of annual rainfall) and summer 
(fig. S3). Importantly for this study, changes in 
rainfall (fig. S4) are consistent across south- 
ern Arabia, particularly for extreme droughts, 
which are often supraregional (13). 

The chronology of H12 is based on 20 U-Th 
ages, with an average growth rate of ~0.23 mm 
year ' over the past 2650 years (fig. S5 and 
tables S1 and $2). The oxygen (8!80) and car- 
bon (8°C) isotope records have a ~2-year res- 
olution and reflect fluctuations in rainfall 
amount and EM, respectively (74), as indicated 
by several observations. First, H12 8'°O values 
correlate [correlation coefficient (7) = -0.69; 
p < 0.001] with mean annual rainfall in Oman 
and are consistent with rainfall anomalies for 
the entire Arabian Peninsula (15, 16), with 
higher rainfall related to more negative 5'°O 
values (Fig. 2, A and B). Second, more positive 
H12 8'°O values correspond to a reduced di- 
ameter of the stalagmite cap (Fig. 2C), indic- 
ative of lower drip rates and winter-spring 
precipitation, respectively (17). Third, the strong 
correlation between H12 8'O and 8”°C values 
(7° = 0.52) indicates that calcite did not pre- 
cipitate in isotopic equilibrium with its parent 
drip water. Because 5’°C values are strongly 
governed by CO, degassing, more positive 5”C 
values indicate lower drip rates and a lower 
partial pressure of CO, (Pcoz) of cave air be- 
cause of enhanced ventilation of Hoti Cave at 
times of lower precipitation and cave lake levels 
(fig. S1). Additionally, reduced vegetation den- 
sity and soil microbial activity during drier 
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Fig. 1. Location and settlement maps. (A) Map showing the location of key proxy records and the 
suggested extent of Himyar (red line) and Kinda (green line) at the beginning of the sixth century CE (6). 
Black dashed lines denote the main trading routes (6). (B and C) Settlement patterns in Himyar in the 


(B) fourth and (C) sixth centuries CE (8). 
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Fig. 2. Hoti Cave and meteorological records. (A and B) Comparison of H12 
3'°0 values and (A) mean annual rainfall for Oman (15) and (B) precipitation 
anomalies [with respect to the 1978-1997 average of -1.8 per mil (%o)] averaged for 
the Arabian Peninsula (16). VPDB, Vienna Pee Dee Belemnite international standard. 


years leads to lower biogenic soil Pco, and 
more positive calcite 5°C values (78). Thus, 
H12 880 and 8°C values are primarily gov- 
erned by precipitation and EM. 

The H12 80 profile displays high variabil- 
ity in EM over the past ~2600 years, with two 
periods of persistently lower EM from ~250 BCE 
to 25 CE and from ~480 to 1400 CE, on which 
distinct decadal to multidecadal fluctuations 
in 8'8O were superimposed (Fig. 2C). The most 
pronounced minimum in EM occurred from 
~520 to 532 CE, with the most severe drought 
conditions of the entire H12 record centered 
at ~523 CE + 30 years (Fig. 3B). This date is 
in excellent agreement with other indepen- 
dently dated winter-spring rainfall records 
from the Middle East (Fig. 3). In the Neor Lake 
record from northern Iran (19), a sharp rise in 
titanium (high dust flux) starts at the begin- 
ning of the sixth century CE. The lake level of 
the Dead Sea (20) declined from ~490 CE on- 
ward, and more positive 5"°C values (reduced 
EM) are observed in the Jeita Cave &C record 
from Lebanon at ~510 CE (21). Likewise, the 
frequency of historical accounts of droughts 
in the Middle East peaks between 520 and 
530 CE (Fig. 3F), with the spring of Siloam in 
Jerusalem reportedly drying up at ~520 CE 
(22, 23). Overall, key hydrological records from 
the Middle East indicate drought conditions 
related to a decline in winter-spring precipi- 
tation and a substantial decrease in EM from 
~480 CE onward. By contrast, the Nar Lake 
8'°0 record from Turkey (24) documents a 
substantial increase in precipitation, which 
is most likely related to a northward displace- 
ment of storm tracks in the eastern Mediter- 
ranean at the beginning of the sixth century 
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CE, with fewer rain-bearing storms reaching 
the Fertile Crescent and Arabia. 

Though winter-spring precipitation is the 
main source of annual rainfall throughout the 
Middle East, SM precipitation is an additional 
crucial source of rainfall in southern Arabia 
and Yemen (Himyar) in particular. SM records 
from southern Oman (Qunf Cave) (12, 14), 
Ethiopia (Lake Ashange and Lake Abhé) 
(25, 26), the Arabian Sea (sites RC2735 and 
723) (27, 28), India (Sahiya Cave) (29), and 
China (Dongge Cave) (30) indicate that SM 
wind strength and precipitation reached their 
absolute minimum during the sixth century 
CE as boreal summer insolation, a primary 
driver of monsoon intensity (14), reached 
its lowest Holocene values (Fig. 4). Further- 
more, an abrupt decline in precipitation be- 
tween 450 and 500 CE is evident in the SM 
records from Ethiopia (Lake Ashange) (25), 
the Arabian Sea (sites RC735 and 723) (27, 28), 
and India (core 16A) (37), and, taking age 
uncertainties of the individual records into 
account, these are broadly concurrent with 
decreasing winter-spring precipitation in 
the H12 880 and other hydroclimate records 
displayed in Fig. 4. In addition, the precisely 
dated Sahiya Cave (India) SM record (29) re- 
veals a decrease in SM rainfall after ~490 CE 
(Fig. 3H). Taken together, mean annual pre- 
cipitation in Arabia and northeastern Africa 
reached an absolute minimum at the begin- 
ning of the sixth century CE, with the most 
severe drought conditions persisting between 
~500 and 530 CE (Fig. 4). 

The synchronicity between peak aridity in 
southern Arabia and the sudden decline of 
Himyar points to a possible causal relationship 
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(C) H12 380 (black), 8°C (green), and stalagmite diameter (red line) records. The 
red-shaded area shows EM anomalies with respect to the 1978-1997 mean (-1.8%o). 
Black triangles denote *°°Th dates (tables S1 and $2). Black-shaded bars denote 
the estimated lifetime of kingdoms in southern and central Arabia (11). 


between both. Himyar, with its centralized 
political system of kingship and subordinate 
chiefdoms, was the dominant power in south- 
ern and central Arabia (Fig. 1) (2, 6). Though 
international trade in aromatics and metals 
had earlier also been a major source of rev- 
enue, agricultural production was fundamen- 
tal to both central and regional economies 
(5, 7, 32). The importance of agriculture is 
evidenced by widespread terraced fields in the 
highlands, numerous irrigation systems along 
the desert margin, and a considerable expan- 
sion of hydraulic structures between the first 
and fourth centuries CE (5, 33-35) (figs. S6 
and S7). The dams, for instance, were a sym- 
bol and instrument through which Himyarite 
rulers exerted their authority, as indicated by 
inscriptions. Dams and terraced fields were 
designed to harvest rainfall and runoff (“run- 
off irrigation”) because water was the most 
limited agricultural resource. The structural 
integrity of the terraced fields depended on 
the proper maintenance of the whole system 
of hillside terraces, and dams required con- 
stant repairs, which only a well-organized 
workforce and stable society could provide 
(2, 6, 7). Furthermore, a stable society and a 
functioning political entity were crucial to 
manage the proper allocation of water for 
the irrigated plots, particularly during drier 
years with more frequent water disputes (36). 
Himyar’s agricultural productivity depended 
on a consistent rainfall cycle, with two rainy 
seasons in spring and summer (fig. S8). These 
factors combined made Himyar vulnerable to 
droughts, and low agricultural yields would 
have had a negative impact on Himyar’s socio- 
economic and political stability and thus the 
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proper management of all water-harvesting 
structures. Thus, the abrupt and persistent 
decrease in rainfall and EM, respectively, 
throughout Arabia at the end of the fifth and 
beginning of the sixth century CE must 
have been a substantial stressor, undermin- 
ing Himyar’s resilience to both internal and 
external socioeconomic and geopolitical con- 
flicts, particularly Christian Aksumite attempts 
to control the lucrative trading routes along 
the Red Sea and Arabian Sea coasts. Such 
stress may well have been further exacerbated 
by the possible effects of the global cooling 
that marked the beginning of the so-called 
Late Antique Little Ice Age that is visible in 
proxy data from the later 530s CE (37). 
From the 480s CE, Himyar faced a number 
of problems. Internal conflict between rival 
elite lineages, partly informed by religious af- 
filiations to either Judaism or Christianity, 
was exacerbated by Aksumite intervention 
between 518 and 530 CE that reduced Himyar 
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to vassal status (5, 11, 38-40). This was fol- 
lowed by a brief revival of Himyarite power 
(but under an Aksumite general, Abreha) (5), 
to which a short-lived increase in winter- 
spring precipitation between 530 and 545 CE 
(Figs. 2 and 3) may perhaps have contributed. 
Yet it took this new regime at least four ex- 
peditions over more than 15 years to reestab- 
lish Himyarite political hegemony outside Yemen 
and Hadramaut, suggestive of both the weak- 
ening of its power and the extent of fragmen- 
tation of the traditional order in the region 
(38). Further indications of this include evi- 
dence for drought-induced migration across the 
region into Persian- and Byzantine-controlled 
Syria and Iraq, as well as southward into 
Yemen and Hadramaut, and the failure to 
maintain the dam of Marib, leading to a col- 
lapse of the system in ~580 CE and its final 
destruction in ~600 CE (an event important 
enough to find mention in the Koran 34:15-17) 
(2, 5, 7). The disappearance of Himyarite in- 
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scriptions after ~559 CE (5) may also indicate a 
breakdown of political authority and the estab- 
lished order. From about 570 CE, Himyar came 
under Persian domination and effectively ceased 
to exist as an independent polity (1-3, 5, 6, 40). 

The weakening of Himyar from the 480s CE 
onward, and its rapid conquest by Aksum in 
525 CE, were thus directly coincident with 
both internal conflict as well as increasing 
aridity and declining agricultural yields that 
were due to a simultaneous reduction in 
winter-spring and SM precipitation, culminat- 
ing in the most severe drought conditions of 
the entire Holocene in the early sixth century 
CE. Himyar’s continued weakening through 
the middle decades of the sixth century CE, 
in spite of the temporary resurgence under 
Abreha, and its final dissolution in the 570s CE 
should thus be understood in the context of 
these conditions and the evidence for the per- 
sistence of drought conditions during the sixth 
century CE (Fig. 4). Aksum, however, was 
less affected by the drought because rainfall 
amounts were generally much higher (fig. S9) 
and a reduction in rainfall was therefore 
less detrimental to agricultural productivity. 
Although correlation is not necessarily causa- 
tion, the singular magnitude and persistence 
of sixth-century drought, superimposed on a 
region that was highly dependent on rain-fed 
agriculture, is coincident with a clear turning 
point in Arabian history (2, 4-6), when a cas- 
cade of regional political and socioreligious 
transformations occurred over the following 
decades. 

Whether Himyar’s decline as the dominant 
Arabian polity induced a sociopolitical vacuum 
remains unclear, although this is often assumed 
(4-7). But it has been plausibly argued that 
one result of the changed situation throughout 
Arabia during the sixth and early seventh 
centuries CE was a growth in the importance 
of pilgrimage and economic centers. Of these, 
Mecca (Muhammad’s birthplace) gained con- 
siderable influence in central and southern 
Arabia in the first half of the sixth century CE 
(41). Furthermore, the reduction in the reach 
of Himyarite power drew the competing 
Byzantine and Sasanian empires into increas- 
ing and intensified competition for political 
and religious influence in the region as well 
as for economic resources, including precious 
metals (42, 43). Such involvement was long- 
standing, and their influence was exercised 
both through their client kingdoms, the 
Ghassanid and Lakhmid confederacies (and 
their earlier equivalents) in northern Arabia, 
and through Aksum. Direct intervention was 
thus not new and had indeed occurred on 
several occasions from the third century on 
(44-46). But the crucial difference from the 
middle of the sixth century was precisely the 
absence of a stable Himyarite kingdom. Finally, 
external interventions stimulated a hostile 
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reaction, both to Christian Aksumite proselyt- 
izing and to the political maneuvering of the 
neighboring empires (11), although that this 
promoted a search for a new “Arab” identity 
remains questionable (2, 10, 41). Regardless, 
after 525 CE and the Aksumite invasion, both 
Arabian reactions and outside interventions 
occurred in a geopolitical context without an 
effective centralizing power. The decline and 
dissolution of Himyar was thus a critical element 
in the socioeconomic, political, and religious- 
cultural transformations in Arabia that ulti- 
mately framed the emergence of a range of 
new religious leaders or movements, many 
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focused around monotheistic beliefs, includ- 
ing Islam. The challenges faced by the Islamic 
leadership in unifying the Arabian Peninsula 
in the 620s CE, in spite of substantial oppo- 
sition from such competitors as well as from 
tribes that had converted to Judaism or were 
hostile to the Quraysh (Muhammad’s tribe), 
reflected this situation (2-4, 6, 47). But by the 
late 620s CE, with Himyarite authority a dis- 
tant memory and both Persia and Byzantium 
exhausted by their long war, the early Islamic 
leadership had the field to itself with respect 
to its ability to mount long-distance military 
expeditions. 
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We do not suggest that the extreme droughts 
of the late fifth to early seventh century CE in 
Arabia were the direct trigger for these fun- 
damental changes (43), which were complex 
and, in several respects, of long duration. But 
Wwe can now confirm from an entirely inde- 
pendent proxy that the Arabian Peninsula 
suffered from unusual and extreme aridity in 
the period between ~500 and 530 CE. This 
constituted an important, and hitherto largely 
neglected, factor that can only have under- 
mined Himyarite resilience to both longer-term 
and more immediate internal and external 
stresses (2, 4). 
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A chemoenzymatic strategy for site-selective 
functionalization of native peptides and proteins 
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The emergence of new therapeutic modalities requires complementary tools for their efficient syntheses. 
Availability of methodologies for site-selective modification of biomolecules remains a long-standing 
challenge, given the inherent complexity and the presence of repeating residues that bear functional 
groups with similar reactivity profiles. We describe a bioconjugation strategy for modification of native 
peptides relying on high site selectivity conveyed by enzymes. We engineered penicillin G acylases 

to distinguish among free amino moieties of insulin (two at amino termini and an internal lysine) and 
manipulate cleavable phenylacetamide groups in a programmable manner to form protected insulin 
derivatives. This enables selective and specific chemical ligation to synthesize homogeneous 
bioconjugates, improving yield and purity compared to the existing methods, and generally opens 
avenues in the functionalization of native proteins to access biological probes or drugs. 


ith the increasing number of new 

modalities and biologics in pharma- 

ceutical pipelines (7-4), the selective 

transformation of biomolecules, such 

as proteins and peptides, has emerged 
as an important challenge in organic synthe- 
sis. The need to maintain the complex tertiary 
structure of biological polymers, combined 
with the diversity and repetition of functional 
groups, demands both mild conditions and 
high chemoselectivity, each of which is diffi- 
cult to achieve with existing methods (5-7). 
Current strategies to directly modify native 
proteins rely on the inherent reactivity of func- 
tional groups within the molecule, either by 
targeting modifications at the termini (8) or 
by exploiting subtle differences in the nucleo- 
philicity, pK, (9), or redox potential (10, 11) of 
the repeating amino acid residues, which nec- 
essarily restricts the scope and selectivity of 
such methodologies (5-7) (Fig. 1A). 
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Consequently, genetic modifications of the 
proteins where an (un)natural amino acid or 
recognition motif is inserted as a handle for 
site-selective functionalization have gained 
great interest (5-7). However, the introduction 
of recognition tags invariably alters the bio- 
molecule, which may affect the very function 
that it performs as well as narrowing the scope 
of such approaches. These limitations have 
created a high demand for the invention of 
scalable strategies for site-selective bioconju- 
gation of native proteins to accelerate the dis- 
covery and development of biological probes 
and therapeutics (4, 12-16). Here we show that 
strategies based on enzyme catalysis can en- 
able improved bioconjugation of native pro- 
teins such as insulin (1, Fig. 1, A and B). 

Over the past several decades, the growing 
epidemic of diabetes mellitus has motivated 
many research laboratories to search for ther- 
apies that exhibit improved safety and phar- 
macology (17). The inherently low therapeutic 
index of insulin-based medicines, where over- 
dosing can lead to hypoglycemia, remains a 
substantial medical risk and necessitates com- 
plex dosing regimens (17). Modified insulin 
molecules will advance understanding of 
how the insulin receptor is activated and the 
structure-activity relationships of its binders 
(18) and may ultimately lead to safer insulin- 
based medicines for glycemic controls (17). 
Our laboratories have been pursuing diverse 
human insulin bioconjugates with promising 
therapeutic potential and improved pharma- 
codynamic profiles, including insulin receptor 
partial agonists MK-1092 and MK-5160 (19) 
or glucose-responsive insulin MK-2640 (20) 
(Fig. 1C, 5a to 5e). 
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These medicinal chemistry efforts targeted 
the modification of free amino groups on the 
native peptide. In the case of recombinant 
human insulin 1, this approach required dis- 
criminating among three distinct amine func- 
tionalities: the N-terminal glycine Al (pK, 8.4), 
the N-terminal phenylalanine B1 (pK, 7.1), and 
the side-chain e-amino group of lysine B29 
(pK, 11.1) [Fig. 14; pK, values based on (27)]. 
Our initial conjugation strategy to access in- 
sulin analogs was driven primarily by differ- 
ences in pK,, which afforded only modest 
site selectivity (79). As a result, a challenging 
chromatographic purification was required 
to separate structurally related compounds, 
leading to moderate yields of the bioconju- 
gates. The complexity of implementing pK,- 
based modification of human insulin drove 
the need for a general, scalable ligation strat- 
egy to obtain homogeneous conjugates and 
facilitate further discovery and development 
efforts. 

We envisaged an approach similar to that 
of protecting group strategies used in small- 
molecule chemistry, where selectively pro- 
tected insulins 3 (Fig. 1B) could enable their 
site-selective conjugation with any molecule 
of interest (e.g., sugar, polyethylene glycol, 
peptide), expediting the synthesis of a diverse 
array of modified insulins 5 (Fig. 1C) (79). To 
translate this vision into practice, we planned 
to exploit the specificity displayed by en- 
zymes to prepare derivatives 3. Enzymes are 
particularly suited to catalyze site-specific 
modifications that preserve the structure 
of biomolecules, as they can act under mild 
aqueous reaction conditions to confer high 
chemo- and regioselectivity (13-16). One key 
advantage of these methods is the ability to 
engineer properties of enzymes through di- 
rected evolution to enhance catalytic efficiency 
and to achieve exquisite selectivity toward the 
structural motif of choice (22, 23). As a result, 
the growing number of enzymatic bioconjuga- 
tion strategies provides a powerful extension 
to the scope of existing chemical approaches 
(5). Diverse hydrolases have been applied to 
site-selective protein modification to access 
antibody-drug conjugates, vaccines, and linear 
and macrocyclic peptides (13-16). We were 
encouraged by early reports on the depro- 
tection of N-phenylacetyl-protected peptides 
using penicillin G acylase from Escherichia 
coli (EcPGA, E. C. 3.5.1.11) (24, 25). This en- 
zyme displayed selective activity toward phe- 
nylacetyl amides, leaving peptide bonds and 
C-terminal esters intact. Wider adoption of 
PGA-catalyzed functionalization of proteins 
was hampered by the enzyme’s low activity 
and its inhibition by both substrates and 
products (25, 26). At the outset, we planned 
to explore natural PGAs for their ability to 
discriminate among the free amino groups 
in insulin (Fig. 2) to prepare orthogonally 
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protected (phenylacetyl)insulins 3a to 3f. 
If successful, such a “toolbox” would enable 
well-defined chemical ligation to access the 
full suite of insulin bioconjugates 5 (Fig. 1C). 

We began by testing a library of >100 com- 
mercially available PGA homologs, screening 
for the hydrolysis of tri(phenylacetyl)insulin 
2a, chemically synthesized from insulin 1 with 
activated phenylacetyl derivative 6 (Fig. 2, see 
supplementary materials, Chemistry section 
for details). The screening revealed the existence 
of several selective enzyme preparations, such 
as immobilized acylases from Achromobacter 
sp. (Fermase PS150 and PS750), which displayed 
high selectivity toward hydrolysis of the Al- 


and B29-amides, yielding mono-protected 
Bl-phenylacetylinsulin 3a (Fig. 2). Yet, the 
majority of active acylases, such as the PGA 
from Kluyvera cryocrescens (KcPGA), yielded 
complex mixtures of all possible mono and 
di-(phenylacetyl) products 3a to 3f (fig. $52), 
before converging at fully deprotected insu- 
lin 1 upon prolonged incubation (fig. S55). A 
time course study revealed that the KcPGA 
enzyme displayed different hydrolysis rates 
of phenylacetamide groups of 2a: B29 > Al > 
Bl. Encouraged by the existing hierarchy in 
hydrolysis, we envisaged that the regioselec- 
tivity conferred by this acylase could be 
modulated with directed evolution, providing 
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enzymes that could catalyze site-selective hy- 
drolysis. We decided to test this hypothesis by 
engineering KcPGA to identify variants spe- 
cific for B29-lysine (Fig. 2). 

We introduced changes to the KcPGA con- 
struct to simplify recombinant expression 
and directed evolution in E. coli as the host. 
PGAs are periplasmic heterodimeric enzymes 
composed of two subunits, which require an 
autoproteolytic processing step to form an ac- 
tive enzyme [fig. S1A (27)]. Consequently, the 
signal peptide and the 54 amino acid inter- 
domain region were removed, and a short linker 
between the B and a domains was added, 
similar to a previous report for the EcPGA 
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Fig. 1. Strategies for site-selective insulin functionalization. (A) Structure (left) and sequence (right) of human insulin with highlighted free amino groups at Al (glycine, 
G*), Bl (phenylalanine, F%), and B29 positions (N®-lysine, K®), which can be functionalized through a pK,-based paradigm [pK, values based on (21)]. The disulfide bonds are 
marked in dark gray. (B) A general strategy for well-defined insulin conjugation using biocatalytic protecting group strategy. (C) Insulin conjugate drug candidates (19, 20). 
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homolog [fig. SIB (28)]. In this way, we con- 
structed a single-chain enzyme, referred to 
as PGA®®° where the catalytic residue serine 
is positioned at the N terminus, which obviated 
the need for its posttranslational processing 
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lytical assay in place (supplementary materials 
section 5), we focused on improving the selec- 
tivity for hydrolysis of tri(phenylacetyl)insulin 
2a at the B29 position (Fig. 2). After four rounds 
of enzyme evolution and high-throughput 
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Fig. 2. Generation of phenylacetyl insulin toolbox. Enzymatic synthesis of (phenylacetyl)insulins 3a to 3f through hydrolysis of (triphenylacetyl)insulin 2a or 
direct acylation of insulin 1. All yields are reported as isolated yields. 90.78 mM 2a, Fermase PS150 beads (PGAajg29-4), 100 mM Tris*HCl, pH 8.0, 10 vo I% MeCN, 
30°C, 2 hours. °6.4 mM 2a, 5.7 wt % PGAgzo.n°°°, water, pH 8.5, 30°C, 20 hours. °7.27 mM 2a, 2.8 wt % PGAgzu", 100 mM triethanolamine, pH 8.0, 30°C, 22 hours. 
°7.91 mM 1, 20 equiv PhAcOMe, 2.25 wt % PGAgzac%“2C4*, 20 vol % MeCN in water, pH 8.35, 27.5°C, 6 hours. °8.0 mM 1, 20 equiv PhAcOMe, 5 wt% PGAgz9.4¢°47 CH, 
20 vol % MeCN in water, pH 9.25, 30°C, 3 hours. '8.0 mM 1, 20 equiv PhAcOMe, 5 wt % PGAwzaz9-ace Hs, 20 vol % MeCN in water, pH 8.85, 27.5°C, 1.5 hours. 


Table 1. Properties and performance of evolved acylases used in regioselective insulin functionalization compared to the starting (parent) variant 


(assays performed at 1- to 50-ml scale, with no pH control). The detailed phylogenetic tree is presented in fig. S3. 
STARTING ENZYME EVOLVED ENZYME 
Evolution focus/ Parent Conv. Enz:  Substr. | Top-performing Conv. Enzy:  Substr. | Amino acids 
Reaction variant (select.)  (%)> (mM)° variant (select)  (%) (mM)° changed (no.) 
PGAR® 21%! 10 0.8 PGA g2o-9 44 99%! 1 8.2 16 
36%) 99%) (2.3%) 
PGAR# 32%° 25 5.1 PGA arn ®¢9 85% 2.5 8.2 13 
83%) 96%) (1.7%) 
PGA gtobar-H 48%! 25 6.9 PGA aieac 4-H 92%E 0.8 8.6 15 
97%) 98%) (2.0%) 
PGA g29-9 ®t? 0.5%" 12 43 PGA paa-ach Go CHs 64%! 2.5 12.9 26 
nd) 92%) (3.4%) 
A1,B29-acylation PGA poo. CHS 88! 4 8.6 PGA a1az9-ac®> Hs 99.5k 5 8.6 27 
lto 3f 9%) 37%) (3.5%) 
Global hydrolysis PGAR«O 12%! 16 0.8 PGA giobar- 42 Hs >99%! 16 0.8 8 
(monomer) (1.0%) 
3tol 
Global hydrolysis PGA gtobar-y RS CHis 34%™ 2.5 2 PGA giobal-dimer-H 2° CHS | 93%™ 5 2.5 20 
(dimer) (2.6%) 
4a or 4b to 5a 


*Number of mutations compared to PGA®*°. enzyme variants are color-coded based on their lineage in the phylogenetic tree (fig. S3). Crystal structures and computational models of top 
evolutionary variants are presented in supplementary materials. Enzyme loading refers to the mass of lyophilized clarified cell lysate relative to the mass of the reaction substrate. Results may 
reflect improvements in enzyme expression as well as activity. “Starting insulin or insulin derivative concentration. “2a, 200 mM Tris*HCl, pH 8.0, 30°C, 24 hours. °2a, 200 mM Tris HCl, pH 8.0, 
30°C, 5 hours. '1, 97 equiv PhAcOMe, 100 mM Tris*HCl, pH 8.5, 20 vol % MeCN, 30°C, 2 hours. ® 1, 77 equiv PhAcOMe, 250 mM Tris*HCl, pH 9.25, 20 vol % MeCN, 30°C, 2 hours. "1, 26 equiv 
PhAcOMe, 100 mM Tris*HCl, pH 9.25, 20 vol % MeCN, 30°C, 3 hours. '1, 9 equiv PhAcOMe, 200 mM Tris*HCl, pH 9.25, 20 vol % MeCN, 30°C, 3 hours. !1, 14 equiv PhAcOMe, pH 9.25, 20 vol % 
MeCN, 30°C, 1 hour. “1, 20 equiv PhAcOMe, pH 9, 20 vol % MeCN, 27.5°C, 1 hour, data obtained in a high-throughput screening assay with no pH control. If pH was maintained throughout the 
reaction, 89% selectivity for 3f was achieved (see Fig. 2). '2a, 100 mM Tris*HCl, pH 8.0, 30°C, 22 hours. "4a, 250 mM triethanolamine, 30°C, 24 hours. Conversion and selectivity were 


determined using the following equations: conversion (%) = 
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Are (desired product3x)/ LATE (all products1+ 2a+ 3a-t) * 100%, selectivity (%) = Area (desired product3x)/ LATE (al) products3a-ty * 100%. 
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screening, we identified mutant PGA po9-77°* 


(Table 1), which displayed improved activ- 
ity and 99% selectivity, providing access to 
A1,B1-di(phenylacetyl)insulin 3b in excellent 
yield and purity (Table 1; see supplementary 
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materials section 5 for details). Encouraged by 
this result, we began a divergent evolution 
program targeting Al-selective hydrolysis. 
Two rounds of evolution led to identification of 
PGA 4.77? with 95% selectivity, which enabled 
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the synthesis of B1,B29-di(phenylacetyl)insulin 
3c (Table 1; see supplementary materials sec- 
tion 5 for details). 

Although the above approach successfully 
afforded two of the desired insulin conjugates, 
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Fig. 3. Synthesis of B29,B29’ insulin dimers. (A) MK-1092 in 73% yield over two steps from 3d; (B) MK-5160 in 66% yield over three steps from 3e. 


Table 2. Activity and selectivity of evolved acylating PGA variants used in functionalization of native proteins. nd, — not determined. 


NUCLEOPHILE SEQUENCE ENZYME CONVERSION SELECTIVITY ? 
(modified site)? 
Insulin a —EEE PGA ya yitoHs 70% (Al) >99% 
-chain =GIVE! rs -_YQLENYC HAC 5 
(1) sich area TSICSLYQLE om | PGA, Ris 74% (B29), 26% (A1B29) 74% 
pl 5.4 B-chain H,N=F VNQHLCGSHLVEALYLVCGERGFFYTPKT PGA, oa 7% (Al), 36% (B29), 54% 55% 
sity (A1B29), 2% (A1B1B29) 

Glargine j —— PGAy a 75% (Al) >99% 

(9) A-chainH,N=GIVEQ . PGA aro schoo 32% (Al) >99% 

pl 67 B-chainH,N=F VN QH LCGSHLVEALYLVCGERGFFYTPKTRR PGA iesoag 65% (Al) >99% 

NH 

Glucagon lines aoTeTspyskyLosRRAaoFVawumNT ada 14% (HD) 99% 

(10) : NH, PGA gag, eo, trace nd 

pl 6.75 PGA sissies trace nd 

Melittin 7 PGA yah Hs 96% (G1) >99% 

(11) ; m= Cn PGA rn aeSt®-oH 33% (Gl) >99% 

pl 47° PGA pro acts 54% (G1) >99% 

Apamin PGA rn trace nd 

(12) PGA 394 oo trace nd 
15.50 HN PG Ae Rd2A9-CHis d 

p!5.5 me arn trace n 

Somatostatin PGA pra RCH 2% (Al), 1% (KA), 1% (K9) nd 

(13) HNtAGCKNFFWKTFTSC PGA. 6 Hs 4% (Al), 0.5% (KA), 3% (K9) nd 

pl 5.6° NHp NH, PGA aro acti2 on 1% (AL), 1% (K4), 5% (K9); nd 


*Reactions were conducted at 10 to 20 mg/ml peptide concentration in 100 mM triethanolamine buffer (pH 8.5) at 5 wt% enzyme loading and analyzed after 6 hours. Conversion and selectivity 
were determined using the following equations: conversion (%) = Areaacytated producty/ZATEAall acylated products + native protein) * 100%; selectivity (%) = Area (acylated producty/ZATEA ai! acylated products) * 
100%. Site selectivity was determined by top-down MS analysis. °Calculated using Expasy Compute pl tool (https://web.expasy.org/compute_pi/). 
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we recognized that the selective deprotection 
of phenylacetyl groups requires chemical syn- 
thesis of tri(phenylacetyl)insulin 2a, adding 
steps. By contrast, a direct selective installa- 
tion of phenylacetyl groups on insulin 1 (Fig. 1B 
and 2) could offer a streamlined route to pro- 
tected insulins 3. PGA-catalyzed amide hydrol- 
ysis is a reversible process that has also been 
applied in amide synthesis by thermodynami- 
cally (29) or kinetically (30) controlled pro- 
cesses in aqueous media (27). Previous reports 
have shown that protein engineering can be 
used to increase the rate of enzymatic acyl 
transfer from phenylacetyl esters to amines 
(V,) to outcompete the rate of the ester hy- 
drolysis (V;,) (31, 32). We tested the kinet- 
ically driven site-selective acylation of insulin 
using methyl phenylacetate (PhAcOMe) as a 
donor (Fig. 2) to access the remaining de- 
rivatives 3d to 3f and complete the build of 
our “insulin toolbox.” Variants from early 
rounds of PGA evolution catalyzed the for- 
mation of Al-(phenylacetyl)insulin 3d using 
an excess of methyl phenylacetate at pH 8.0 
with acetonitrile as a cosolvent (20 vol %) with 
high (97%) selectivity, albeit low conversion 
(Table 1; see supplementary materials sec- 
tion 5 for evolutionary details). Two subsequent 
rounds of evolutionary screening yielded the 
PGA4yae variant, which catalyzed the for- 
mation of Al-(phenylacetyl)insulin 3d by 
direct acylation of insulin 1 in 92% yield and 
98% selectivity (Table 1 and supplementary 
materials section 5). At this point, to facil- 
itate large-scale process development, as well 
as the structural studies, we introduced a 
hexa-histidine tag at the C terminus of the 
acylase (CHis) to provide the PGA 47.4.0?" 
variant. 

Encouraged by these results, our protein 
engineering efforts next targeted the evolu- 
tion of the acylases to prepare B29-protected 
and A1,B29-di-protected peptides 3e and 3f. 
The B29-selective variant PGApoo-7/***? be- 
came the evolutionary starting point (see fig. 
83, evolutionary tree). The initial studies high- 
lighted the importance of pH in achieving a 
high V;/V;, ratio for kinetically controlled 
processes (33). The higher pK, of e-terminal 
amino group of lysine (pK, 11.1°%"° versus 
7,yPhenylalanine Fig 1A) required increasing the 
reaction pH to 9.25 to achieve >95% conver- 
sion to B29-amide. Using these conditions, 
directed evolution campaigns enabled the 
identification of two variants: PGApog.4e 
for B29-acylation and PGA 47p99.4¢8 Cs for 
A1,B29-diacylation (Table 1 and fig. S3, evolu- 
tionary tree). 

With five engineered variants of PGA from 
K. cryocrescens and the complementary activity 
of the PGA from Achromobacter sp., we gained 
access to all six combinations of protected 
mono- and di(phenylacetyl)insulin analogs 
(8a to 3f, Fig. 2) for site-selective conjugation. 
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Our protecting-group strategy additionally 
required a complementary enzyme to catalyze 
a robust and highly selective deprotection of 
protected insulins (Fig. 1C). Additional rounds 
of directed evolution of the parent PGA with 
a focus on improving hydrolysis at all three 
positions of tri(phenylacetyl)insulin 2a yielded 
variant PGAgiosar#” with fourfold higher ac- 
tivity on the monomeric insulin (Table 1). 

Our work highlights the flexibility and effi- 
ciency with which protein engineering can 
alter the specificity of a promiscuous parent 
acylase PGARA, whose native substrate is a 
small-molecule 6-aminopenicillanic acid, and 
enable site-selective activity on non-native sub- 
strates such as insulin 1 and its derivatives 2 to 
4. The six divergent directed evolution cam- 
paigns demonstrate that we can successfully 
tune both regioselectivity and kinetic rates 
of enzymatic transformations, i.e., V,/V;, ratio 
of hydrolysis versus acylation. The final evolu- 
tionary variants required the incorporation 
of 11 to 26 amino acid mutations across the 
entire protein length (from 1.4 to 3.4% of a 
764-amino acid single peptide, Table 1 and 
supplementary materials sections S4 and S5). 
To gain insights into how the specific muta- 
tions affect the selectivity and activity of the 
top-performing variants, we acquired crystal 
structures of the variants PGAgioparz 
(PDB ID 7reo, 1.81 A) and PGAyp ach Hs 
(PDB ID 7rep, 2.19 A), and for the other top- 
performing PGA variants we created compu- 
tational models (Fig. 2B and figs. S9 to S20) 
The crystal structures closely match that of the 
heterodimeric K. cryocrescens PGA [PDB ID: 
4pem (34), figs. S5 to S8]. Many mutations 
were in the flexible loops next to the active 
site and on the protein surface. The mutations 
introduced in the binding pocket and within 
10 A of its entrance are most likely critical 
for insulin recognition (Fig. 2B), whereas the 
changes that are remote from the active site 
might play a role in improving the overall 
enzyme fitness, such as stability in the pres- 
ence of organic cosolvents. Some mutations, 
including silent DNA mutations, contributed 
toward the improvements in soluble protein 
expression in E. coli, which lowered the en- 
zyme load to achieve scalable biotransforma- 
tions (see supplementary materials, protein 
engineering and directed evolution section for 
details). Bioinformatic analysis of the gene 
sequence-function data of >200,000 variants 
screened across all six evolution programs high- 
lighted the critical role of F7O1W and N2L 
mutations in the active site (fig. S8) to achieve 
Al selectivity (see supplementary materials 
section 3 for a detailed discussion); however, 
the exact role of many other mutations was 
difficult to rationalize. 

After the development of the insulin toolbox 
(Fig. 2 and Table 1), we explored the appli- 
cation and generality of this protecting-group 
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strategy in the synthesis of our two clinical 
candidates MK-1092 (5a) and MK-5160 (5b) 
(Fig. 3) (19). These conjugates are insulin 
dimers stapled at the B29,B29’ positions with 
either a trans-1,4-cyclohexyldicarboxyamide or 
a suberoyl amide linker, respectively. Protected 
insulins 3 could provide a well-defined chem- 
ical dimerization and obviate the chromato- 
graphic purification at a large scale, facilitating 
process development, as well as further ex- 
ploration of any structural insulin analog of 
interest. 

We first studied the synthesis of MK-1092 
through dimerization of A1,B1-di(phenylace- 
tyl)insulin 3b, followed by global hydrolysis 
of phenylacetamides catalyzed by deprotect- 
ing variant PGAgiopa-H? “""* (scheme $22). 
Initial trials showed clean, yet slow, enzymatic 
phenylacetamide deprotection, requiring 
>3 days to reach completion. A single round 
of evolution provided PGAgiopardimer-H 
with 2.5-fold improved activity. With this var- 
iant, both reactions proceeded in high con- 
version to afford MK-1092 5a in 93% purity, 
compared to <50% purity of the direct dimer- 
ization driven by differences in the pK, of 
B29-lysine (35). Despite the improvements 
in product purity, this bioconjugation route 
suffered from a relatively high overall step 
count (four steps from insulin 1 through chem- 
ically prepared tri(phenylacetyl)insulin 2a). A 
more efficient approach would be the direct 
dimerization of Al-(phenylacetyl)insulin 3d 
(Fig. 3A). The derivative 3d could be prepared 
from insulin 1 in one step with high selectivity 
that enabled its isolation by precipitation in 
93% yield (Fig. 2A) and obviated the need for 
its chromatographic purification. Notably, the 
conjugation of 3d with N-hydroxysuccinimide 
(NHS)-ester 6a proceeded with high selectivity 
to form the desired A1,A1’-dimer 4b (Fig. 3A). 
We hypothesize that the phenylacetamide on 
Al is reducing surface accessibility of the un- 
protected N-terminal B1-phenylalanine residue, 
which did not participate in conjugation. Once 
dimerization was complete, the reaction stream 
could simply be diluted with water and sub- 
jected to the PGA-catalyzed deprotection. To 
facilitate large-scale delivery of MK-1092, we 
further engineered the deprotection enzyme 
to act on the dimer 4a. Two evolutionary 
rounds afforded PGAgiobatdimer-tt  s Which 
displayed a 2.5-fold increase in activity (Table 1; 
see supplementary materials section 10 for 
details). Using PGAgiopatdimer-H#  Vari- 
ant, we generated >120 g of MK-1092 in 82% 
purity and 73% isolated yield over just two 
steps. This two-pot synthesis substantially 
improved the yield as compared to the direct 
conjugation used in the initial chemistry route 
(19, 36). 

The synthesis of MK-5160 5b required a dif- 
ferent protected insulin to selectively in- 
stall the carbamoyl groups capping both Al 
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and B1 N-terminal amines. The pK,-driven 
strategy proved challenging to selectively 
make insulin derivative 8 [<70% purity, 
(19, 36)]. Consequently, we subjected B29- 
(phenylacetyl)insulin 3e, synthesized from 
insulin 1 using PGApo9.4°0" "in acylation, 
to the chemical carbamoylation using potas- 
sium isocyanate under aqueous conditions to 
give derivative 7 (Fig. 3B). The subsequent 
enzymatic deprotection step was performed 
in a single pot without intermediate isola- 
tion to provide A1,B1-(dicarbamoyl)insulin 8 
in 90% yield and 92% purity over the two 
steps. The dimerization with suberoyl NHS- 
ester (6b) afforded the target molecule MK- 
5160 5b an overall 86% purity and 66% yield 
from B29-(phenylacetyl)insulin 3e, twofold 
higher in yield compared to the initial route 
(19). Synthesis of the two clinical candidates 
MK-1092 and MK-5160 demonstrates the value 
of our chemoenzymatic strategy to provide 
homogeneous protein conjugates at scale. 

The enzymatic toolbox enables a simple, 
modular approach to the synthesis of struc- 
turally diverse insulin bioconjugates, allowing 
for efficient exploration of structure-activity 
relationships (37). The engineered PGAs can 
also accept other acyl donors such as phenoxy- 
or (4-ethynylphenyl)acetates (see supplemen- 
tary materials section 10 and table S11), which 
can provide functional handles for further 
structural diversification, i.e., click chemistry. 
In addition, this strategy could be extended 
to the site-selective bioconjugation of other 
native proteins as well. Accordingly, we tested 
five other small peptides with the three acylating 
PGA variants (PGA 47.4¢°? FS, PGA pog 4 ROS, 
and PGA 47799-4087 ""'®) (Table 2; see sup- 
plementary materials section 3.4 for details). 
For our studies, we selected insulin homolog 
glargine (9); two linear single-chain peptides, 
glucagon (10) and melittin (11); and two single- 
chain cyclic peptides containing disulfide 
bridges, apamin (12) and somatostatin (13), 
while insulin (1) was used as a control (scheme 
$21 and fig. S26). 

Whereas the acylating PGAs were evolved 
for their activity and selectivity toward insulin 
(1), the enzymes could modify other peptides 
(9 to 13), albeit with lower activity and dif- 
ferent site selectivity. For example, insulin- 
glargine (9) was acylated exclusively at Al 
position by all three tested enzymes. Com- 
pared to human insulin (1), glargine contains 
two additional C-terminal arginines at B-chain, 
R31, and R32, and this change in the amino 
acid context was sufficient to completely abol- 
ish the enzymatic activity on y-amine of the 
adjacent lysine B29. Similarly, the two linear 
peptides were only functionalized at their 
N termini: glycine of melittin 9 (33 to 96% 
conversion) and histidine of glucagon 10 (14% 
conversion with PGA 47.4.°7?“' only). No or 
trace activity was noted with internal lysine 
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residues. With cyclic peptides, apamin (13) 
proved least amenable to modifications with 
trace activity. By contrast, PGAs modified all 
three free amines of somatostatin (12), albeit 
with low conversions and varying site regio- 
selectivity. The differences in reactivity among 
these peptides are challenging to rationalize 
or predict. Three-dimensional structure of the 
peptide substrates, steric and electronic con- 
text of the adjacent amino acids, character 
and pK, of the modified residue: All of these 
factors undoubtedly play a role in site selec- 
tivity and protein-protein recognition. Iso- 
electric point pI and solubility of the substrate 
peptide under given pH may also contribute 
to the observed selectivities and efficiency 
of the modifications. Future efforts to obtain 
structural data could advance our understand- 
ing of peptide-enzyme recognition, enabling 
the rapid construction of structurally diverse 
peptide conjugates. 

Our biocatalytic protecting-group strategy 
has provided well-defined, scalable ligation of 
insulin amines and enabled the construction 
of structurally complex bioconjugates of native 
peptides 5a and 5b with efficiency far ex- 
ceeding that of existing synthetic approaches 
(5, 19). The development of our enzymatic 
toolbox demonstrated the distinctive ability 
of enzyme catalysis to differentiate among 
amino acid residues in peptide substrates, 
which otherwise display similar chemical 
reactivity. Directed evolution remains a key 
tool to reprogram enzymes to perform site- 
specific conjugation on a peptide of interest, 
providing a control strategy previously un- 
achievable with traditional methods for liga- 
tion of native proteins or other large molecules. 
We expect that bioconjugation strategies using 
naturally occurring and engineered enzymes to 
act on native proteins and peptides will con- 
tinue to grow (13-15), accelerating the discov- 
ery and development of diagnostic probes and 
therapeutics (7-4). Progress in structural elu- 
cidation, as well as emerging machine learning 
approaches to predict the structure of inter- 
acting protein chains and complexes (38-41), 
will play a key role in the protein engineering 
and bioconjugation fields. 
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Analysis of 6.4 million SARS-CoV-2 genomes 
identifies mutations associated with fitness 
Fritz Obermeyer'?*+, Martin Jankowiak'”, Nikolaos Barkas?, Stephen F. Schaffner’*, 


Jesse D. Pyle’+, Leonid Yurkovetskiy®, Matteo Bosso®, Daniel J. Park’, Mehrtash Babadi?, 
Bronwyn L. Macinnis'*®, Jeremy Luban*>°”, Pardis C. Sabeti-?+°%s, Jacob E. Lemieux>®?*§ 


Repeated emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants 
with increased fitness underscores the value of rapid detection and characterization of new lineages. 
We have developed PyRo, a hierarchical Bayesian multinomial logistic regression model that infers 
relative prevalence of all viral lineages across geographic regions, detects lineages increasing in 
prevalence, and identifies mutations relevant to fitness. Applying PyRo to all publicly available 
SARS-CoV-2 genomes, we identify numerous substitutions that increase fitness, including previously 
identified spike mutations and many nonspike mutations within the nucleocapsid and nonstructural 
proteins. PyRo forecasts growth of new lineages from their mutational profile, ranks the fitness of lineages 
as new sequences become available, and prioritizes mutations of biological and public health concern 


for functional characterization. 


he severe acute respiratory syndrome 

coronavirus 2 (SARS-CoV-2) pandemic 

has been characterized by repeated waves 

of cases driven by the emergence of new 

lineages with higher fitness, in which 
fitness encompasses any trait that affects the 
lineage’s growth, including its basic repro- 
duction number (Ro), ability to evade existing 
immunity, and generation time. Rapid iden- 
tification of such lineages as they emerge 
along with accurate forecasting of their dy- 
namics is critical for guiding outbreak response. 
The ability to interrogate the entirety of the 
global SARS-CoV-2 genomic dataset would 
be greatly beneficial in doing this effectively. 
The large size (currently >7.5 million virus ge- 
nomes) and geographic and temporal variability 
of the available data present considerable chal- 
lenges that will become greater as more viruses 
are sequenced. Current phylogenetic approaches 
are computationally inefficient on datasets with 
more than ~5000 samples and take days to run 
at that scale. Ad hoc methods to estimate the 
relative fitness of particular SARS-CoV-2 lineages 
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are a computationally efficient alternative (/-3) 
but have typically relied on models in which 
one or two lineages of interest are compared 
with all others and do not capture the complex 
dynamics of multiple cocirculating lineages. 

Furthermore, estimates of relative fitness 
based on lineage frequency data alone (2-5), 
which can be extended to multiple lineages 
(1, 3-5), do not take advantage of additional 
statistical power that can be gained from 
analyzing the independent appearance and 
growth of the same mutation in multiple line- 
ages. Performing a mutation-based analysis of 
lineage prevalence has the additional advan- 
tage of identifying specific genetic determi- 
nants of a lineage’s phenotype, which is critical 
both for understanding the biology of trans- 
mission and pathogenesis and for predicting 
the phenotype of new lineages. The SARS- 
CoV-2 pandemic has already been dominated 
by several genetic changes of functional and 
epidemiological importance, including the 
spike (S) D614G mutation associated with 
higher SARS-CoV-2 loads (6, 7). Mutations 
found in Variants of Concern (VoC), such as 
S:N439R, S:N501Y, and S:E484K, have been 
linked to increased transmissibility (8), enhanced 
binding to ACE2 (9), and antibody escape, 
respectively (10, 11). Despite these successes 
identifying functionally important mutations 
in the context of a large background of genetic 
variants of little or no phenotypic consequence 
remains challenging. 

In modeling the relative fitness of SARS- 
CoV-2 lineages, we estimated their growth as 
a linear combination of the effects of individ- 
ual mutations. To this end we developed PyRo, 
a hierarchical Bayesian regression model that 
enables scalable analysis of the complete set of 
publicly available SARS-CoV-2 genomes, which 
can be applied to any viral genomic dataset and 
to other viral phenotypes. The model—which is 
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summarized in fig. S1 and described in detail in 
the supplementary materials—avoids the com- 
plexity of full phylogenetic inference by first 
clustering genomes by genetic similarity (refin- 
ing PANGO lineages) (12) and estimating the 
incremental effect on growth rate of each of 
the most common amino acid changes on the 
lineages in which they appear. By regressing 
growth rate on genome sequence, the model 
shares statistical strength among genetically 
similar lineages without explicitly relying on 
phylogeny. By modeling only the multinomial 
proportion of different lineages rather than the 
absolute number of samples for each lineage 
(13, 14) and by doing so within 14-day intervals 
in 1560 globally-distributed geographic regions, 
the model achieves robustness to a number of 
sources of bias that affect all lineages across 
regions and over time, including differences in 
data collection and changes in transmission 
due to factors such as social behavior, public 
health policy, and vaccination. 

We fit PyRo to 6,466,300 SARS-CoV-2 genomes 
available on GISAID (15) as of 20 January 2022, 
in a model that contained 3000 clusters derived 
from 1544 PANGO lineages and 2904 non- 
synonymous mutations. The output of the model 
is a posterior distribution for the relative fit- 
ness (exponential growth rate) of each lineage 
and for the contribution to the fitness from 
each mutation. Fitting this large model is com- 
putationally challenging, so we used stochastic 
variational inference—an approximate infer- 
ence method that reduced our task to solving 
an optimization problem with 75 million di- 
mensions on a GPU. Inference was implemented 
in the Pyro (6) probabilistic programming 
framework (see supplemental materials). The 
trained model can be used to infer lineage 
fitness, predict the fitness of completely new 
lineages, forecast future lineage proportions, 
and estimate the effects of individual muta- 
tions on fitness. 

The model’s lineage fitness estimates (Fig. 
1) show a modest upward trend over time 
among all lineages, interrupted by several 
lineages with much higher fitness. Sensitiv- 
ity analyses revealed qualitative consistency 
of fitness estimates across spatial data sub- 
sets (fig. S2). The upward trend may in part 
reflect an upward bias caused by the lineage 
assignment process as can be seen in sim- 
ulation studies (fig. $3); however, the high tail 
of the distribution exhibits elevated fitness 
values far in excess of this trend. The spread 
of the virus into human populations in late 
2019 and early 2022 was marked by periods 
of rapid evolution in fitness and waves of in- 
creased case counts (Fig. 1). Although PANGO 
lineages facilitate communication by provid- 
ing a stable nomenclature, we observed some 
PANGO lineages with multiple successive peaks 
in some regions, suggesting that sublineages 
within them had differing levels of fitness. We 
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therefore algorithmically refined the 1544 
PANGO lineages into 3000 finer clusters and 
found that our model identified substantial 
heterogeneity within some PANGO lineages 
(fig. S4). When we tested the model’s predic- 
tive ability (fig. S5), we found that forecasts 
were reliable for 1 to 2 months into the future 
for variants of concern—but not necessarily 
other variants—when they tended to be dis- 
rupted by the emergence of a completely new 
strain (table S1 and fig. S6). The accuracy of 
forecasts that stabilized typically did so within 
two weeks after the emergence of a new com- 
petitive lineage in a region (fig. S6). 

The model correctly infers World Health 
Organization classification variant Omicron 
(PANGO BA.2) to have the highest fitness to 
date: 8.9 times [95% confidence interval (CI) 
8.6 to 9.2] higher than the original A lineage 
(Fig. 1, inset), accurately foreshadowing its 
rise in regions where it is circulating (fig. S7). 
Through systematic backtesting we found 
that the model would have provided early 
warning and aided in the identification of 
VoCs had it been routinely applied to SARS- 
CoV-2 samples, confirming its utility for public 
health and underscoring the value of rapid 
sharing of genomic data. For example, the ele- 
vated fitness of BA.2 was identified by mid- 
December 2021 on the basis of 76 reported 
sequences (fig. S8); sharing statistical strength 


over mutations enabled an earlier and more 
confident prediction that BA.2 was the fittest 
lineage yet observed (fig. S14). Likewise, PyRo 
would have forecast the dominance of B.1.1.7 in 
late November 2020 (fig. S9), AY.4: by May 2021 
(fig. S10), and BA.1 by early December 2021 (fig. 
88). Although variant-specific models were ac- 
curate and useful in predicting the rise of these 
lineages (2), each modeling effort was specific 
to a particular lineage and geographic region. 
PyRg’s global approach provides similar early 
detection while also offering automated, rapid, 
and standardized consideration of all variants 
and lineages together with ranking based on 
relative fitness. 

Compared to standard multinomial regres- 
sion models, PyRg estimates of lineage fitness 
were similar (Pearson’s R = 0.95, figs. S11 and 
$12), but inclusion of mutations in the model 
enables PyRo to infer elevated fitness of Omicron 
lineages BA.1 and BA.2 faster than the model 
without mutations (fig. S14). In contrast to 
nonhierarchical binomial logistic regression 
(fig. S13), PyRp estimates displayed less varia- 
bility as data accumulated, benefitting from 
the sharing of information across regions and 
the regularizing effect of the prior probability 
distributions for the model coefficients. Line- 
age fitness estimates were also stable between 
our initial analysis of 2.1 million genomes in 
August 2021 (17), shortly after the emergence 
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Table 1. Amino acid substitutions most significantly associated with increased fitness. 
Significance is defined as posterior mean or posterior standard deviation. Fitness is per 5.5 days 
(estimated generation time of the Wuhan (A) lineage) (1, 23). The final column is the number of 
PANGO lineages in which each substitution emerged independently. 
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of the Delta lineages and before the emergence 
of Omicron (Spearman’s rho = 0.78; fig. SI5C). 
The correlation between individual amino acids 
in the two models was weaker than that for 
lineages (fig. S15, D and E; rho = 0.48) but still 
significant (test of no association for rho, P < 
2 x 107"), reflecting both the inherent difficulty 
of estimating high-dimensional mutational co- 
efficients observed indirectly through lineage 
counts (supplementary note 1) as well as the 
addition of 4.3 million sequences, including 
highly fit Omicron lineages distinguished by 
their enhanced immune escape. 

By jointly modeling fitness estimates with 
lineage counts and individual mutations PyRg 
harnesses convergent evolution (Table 1 and 
fig. S16) to infer the fitness of new constella- 
tions of mutations based on the trajectories of 
other lineages in which they have previously 
emerged. This predictive capability has the 
potential to aid public health efforts because 
the model has the potential to learn faster by 
incorporating mutations than it would by 
relying on lineage counts alone (fig. S14). To 
test the reliability of this kind of estimate we 
fit leave-one-out estimators for PANGO line- 
ages on subsets of the dataset with that entire 
lineage removed, based solely on the muta- 
tional content of the omitted lineage (fig. S17). 
These estimators showed excellent agreement 
with estimators based on the observed behav- 
ior of the lineages; they were also more accu- 
rate than naive phylogenetic estimators that 
assume that the fitness of each new strain is 
equal to its parent lineage’s fitness (Pearson’s 
R = 0.983 after correcting for parent fitness; 
fig. S17). Together these analyses suggest that 
PyRo has the potential to aid genomic surveil- 
lance efforts by providing an automated early 
warning system on a time scale similar to that 
of sophisticated regional surveillance efforts 
(78, 19). 

Genome-wide estimates of the effect of SARS- 
CoV-2 mutations on fitness also provide a pow- 
erful tool for better understanding the biology 
of fitness. Our model allowed us to estimate 
the contribution of 2904 amino acid substitu- 
tions (Fig. 2A and Table 1) to lineage fitness 
and to rank them by inferred statistical signif- 
icance (fig. S18). Cross-validation confirmed 
that these results replicate qualitatively across 
different geographic regions (fig. S19). 

The highest concentrations of fitness- 
associated mutations were found in the S, N, M, 
and ORF! polyprotein genes (ORFla and ORFIb; 
Fig. 2, A and B, and figs. S20 and S21). 
Using spatial autocorrelation as a measure of 
spatial structure, we found evidence of functional 
hotspots in the S, N, ORF7a, ORF3a, and ORFla 
genes (table S2). Within S, we confirmed three 
hotspots of fitness-enhancing mutations, each 
within a defined functional region: the N- 
terminal domain, the receptor-binding domain 
(RBD), and the furin-cleavage site (Fig. 2B). We 
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Fig. 1. Relative fitness versus date of lineage emergence. Circle size is proportional to cumulative case count inferred from lineage proportion estimates and 
confirmed case counts. The inset table lists the 10 fittest lineages inferred by the model. R/Ra is the fold increase in relative fitness over the Wuhan (A) lineage, 


assuming a fixed generation time of 5.5 days. 


assessed mutational enrichment in the top- 
ranked set of mutations and identified enrich- 
ment for lysine to asparagine mutations in the 
S gene (fig. S22C). We visualized top-scoring 
mutations within atomic structures for the 
spike protein (Fig. 2, D to E), the nucleocapsid’s 
N-terminal domain (Fig. 2F), the polymerase 
(fig. S23), and two proteases (fig. S24). Many of 
the top mutations in the S gene occurred in the 
receptor binding domain (RBD) that makes 
direct contact with the ACE2 receptor, includ- 
ing K417N/T and E484K (Fig. 2, D to E). Two 
top-ranked mutations, T478K and S477N, occur 
in a flexible loop adjacent to the S-ACE2 inter- 
face (Fig. 2E), suggesting that these mutations 
may affect the kinetics of receptor engage- 
ment or the Spike conformational changes 
that follow. Other mutations occurred in 
regions proximal to essential enzymatic active 
sites of viral replication (fig. S23) or protein 
processing (fig. S24) machinery. 

We tested several of the high-scoring muta- 
tions in single-cycle infectivity assays as per- 
formed previously (7), focusing on the RBD 
(Fig. 3A). We found that although some in- 
dividual mutations increased infectivity, on 


Obermeyer et al., Science 3'76, 1327-1332 (2022) 


average high-scoring RBD mutations did not 
promote infectivity per se. We considered an 
alternate possibility that fitness of Spike mu- 
tations is driven by immune escape. Using 
RBD-aggregated mutations as a proxy for im- 
mune escape we found that the fitness effect of 
these Spike mutations correlates well with 
antibody escape estimates from Greaney et al. 
(20) (Fig. 3B). Together with the observed 
jump in fitness beginning in late 2021 (Fig. 3C) 
associated with Spike mutations—but not mu- 
tations elsewhere in the genome (Fig. 3E)— 
these results suggest that immune escape is the 
dominant driver of current fitness increases. 
BA.1 and BA.2 had similar estimated fitness 
from Spike mutations—potentially consistent 
with similar Spike antibody neutralization of 
these variants (27)—whereas PyRo inferred that 
the elevated fitness of BA.2 is attributed to non- 
Spike mutations (fig. S25). In contrast to muta- 
tions in Spike, those in the serine-arginine rich 
region of N were linked to increased efficiency 
of SARS-CoV-2 genomic RNA packaging (22). 
Within ORF1 we found fitness-associated mu- 
tations across all viral enzymes and clusters 
within additional nonstructural proteins (nsps). 
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The highest concentration of fitness-associated 
mutations is found in nsp4, nsp6, and nsps12 
to 14 (figs. S20B and S21, C and D), suggesting 
unexplored function at those sites. For exam- 
ple, nsp4 and nsp6 have roles in assembly of 
replication compartments and substitutions 
in these regions may influence the kinetics of 
replication (see supplemental note 3). We cau- 
tion that although convergent evolution makes 
it possible to identify candidate functional mu- 
tations, observational data alone is insufficient 
to declare mutations as causal rather than 
merely correlated. Our uncertainty-ranked list 
of important mutations can be used to pri- 
oritize hits identified by our study for func- 
tional follow-up. 

Some lineages increased in fitness more 
than others over the course of the pandemic 
(fig. S4). Notably, B.1.1 displayed the greatest 
variability among sublineages, followed by B.1. 
Fitness appeared to reach a plateau over time 
for most lineages (Fig. 1 and fig. $4). In con- 
trast to Omicron sublineages Alpha and Delta 
showed little variability in Spike-attributable 
fitness (fig. S25), suggesting that the propen- 
sity to acquire new Spike mutations depends 
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Fig. 2. Summary of amino acid changes assessed in this study. (A) Manhattan plots of amino acids and changes across the entire genome. (B) Changes in the first 
850 amino acids of S. (©) Changes in the first 250 amino acids of N. The y axis in (A) to (C) shows effect size A log R, the estimated change in log relative fitness resulting 
from each amino acid change. The bottom three axes show the background density of all observed amino acid changes, the density of those associated with growth 
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fig 2 continued 


(weighted by |A log R|), and the ratio of the two. The top 55 amino acid changes are labeled. See fig. S21 for detailed views of S, N, ORFla, and ORF1b. 
(D) Structure of the spike-ACE2 complex (PDB: 7KNB). Spike subunits are colored light purple, light orange, and gray, top-ranked mutations are shown as 
red spheres, and ACE2 is shown in magenta. (E) Close-up view of the RBD interface. (F) Top-ranked mutations in the N-terminal RNA-binding domain of 

N. Residues 44 to 180 of N (PDB: 7ACT) are shown in light purple. Amino acid positions corresponding to top mutations in this region are shown as red spheres. 


A 10-nucleotide (nt)- bound RNA is shown in gray. 
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Fig. 3. Characterization of mutation-specific effects. (A) Infectivity relative to 
wild type (WT) of lentiviral vectors pseudotyped with the indicated Spike mutants. 
Target cells were HEK293T cells expressing ACE2 and TMPRSS2 transgenes. The 
genetic background of the Spike was Wuhan-Hu-1, bearing D614G. Red bars were 
significantly different from WT (adjusted p values shown) and black bars were not 
significantly different from WT. (B) For the 1701 SARS-CoV-2 clusters with at 
least one amino acid substitution in the RBD domain we compare: (i) the PyRo 
prediction for the contribution with A log R from RBD substitutions only; with (ii) 
antibody binding computed with the antibody-escape calculator in (20). The 


escape calculator is based on an intuitive nonlinear model parameterized with deep 
mutational scanning data for 33 neutralizing antibodies elicited by SARS-CoV-2. 
PyRg predictions exhibit high (Spearman) correlation with predictions from 
Greaney et al. (20) (C to E) We dissect PyRo A log R estimates into the S-gene (C), 
RBD (D), and non-S-gene (E) contributions for 3000 SARS-CoV-2 clusters (blue 
dots). The horizontal axis corresponds to the date at which each cluster first 
emerged. Red squares denote the median A log R within each monthly bin. The 
increased importance of S-gene mutations (notably in the RBD) over non-S-gene 
mutations starting around November 2021 is apparent. 


on the constellation of mutations that consti- 
tute a lineage, consistent with epistasis. A limi- 
tation of PyRy is that it does not incorporate 
epistatic interactions between mutations (sup- 
plemental note 1); however, our results dem- 
onstrate the feasibility of inferring genetic 
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determinants and lineage fitness with the 
simplest possible linear-additive model and 
provide a foundation for future research for 
more complex modeling that includes epi- 
static effects between mutations and migra- 
tion across geographic regions. 
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In summary, PyRg provides a genome-wide, 
automated approach for detecting viral line- 
ages with increased fitness. By combining a 
model-based assessment of lineage fitness with 
absolute case counts, our model provides a 
global picture of the events of the first two 
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years of the pandemic. Because it assesses the 
contribution of individual mutations and ag- 
gregates across all lineages and geographic 
regions, it can identify mutations and gene re- 
gions that likely increase fitness; additionally, 
mutation-level information may help detect 
fitter lineages earlier than case counts alone. 
Applied to the full set of publicly available 
SARS-CoV-2 genomes, it provides a genomic 
view of the mutations driving increased fitness 
of the virus, identifying experimentally estab- 
lished driver mutations in S and highlighting 
the key role of non-S mutations particularly in 
N, ORFIb, and ORFla, which have received 
relatively less research attention than muta- 
tions in S. By modeling millions of viral se- 
quences across thousands of regions, PyRo 
yields mechanistic insight into viral fitness 
and offers a panoramic view of viral evolution, 
revealing a pattern whereby major circulating 
lineages fragment into sublineages with mod- 
est differences in fitness before they are col- 
lectively displaced by the sudden emergence of 
markedly fitter variants. 
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Glacial ice supports a distinct and undocumented 
polar bear subpopulation persisting in late 
2ist-century sea-ice conditions 

Kristin L. Laidre’?*, Megan A. Supple’, Erik W. Born’, Eric V. Regehr’, @ystein Wiig‘, 
Fernando Ugarte?, Jon Aars®, Rune Dietz®, Christian Sonne®, Peter Hegelund?, Carl Isaksen2, 


Geir B. Akse+, Benjamin Cohen’, Harry L. Stern’, Twila Moon’, Christopher Vollmers®, 
Russ Corbett-Detig®, David Paetkau®, Beth Shapiro>”° 


Polar bears are susceptible to climate warming because of their dependence on sea ice, which is 
declining rapidly. We present the first evidence for a genetically distinct and functionally isolated group 
of polar bears in Southeast Greenland. These bears occupy sea-ice conditions resembling those 
projected for the High Arctic in the late 21st century, with an annual ice-free period that is >100 days 
longer than the estimated fasting threshold for the species. Whereas polar bears in most of the 

Arctic depend on annual sea ice to catch seals, Southeast Greenland bears have a year-round hunting 
platform in the form of freshwater glacial mélange. This suggests that marine-terminating glaciers, 
although of limited availability, may serve as previously unrecognized climate refugia. Conservation of 
Southeast Greenland polar bears, which meet criteria for recognition as the world’s 20th polar bear 
subpopulation, is necessary to preserve the genetic diversity and evolutionary potential of the species. 


he impact of human-mediated climate 

warming on biodiversity is a global con- 

servation concern (1). Shifts in species 

phenology, distribution, and abundance 

are altering ecological communities and 
increasing extinction risk (2). Understanding 
how life history strategies affect species per- 
sistence, including the role of within-species 
diversity at multiple spatial scales, is critical 
for conservation (3, 4). 

The ongoing and predicted decrease in Arctic 
sea ice (7) raises concerns for ice-dependent 
species such as polar bears (5), whose survival 
will depend on establishing populations in 
fragmented habitats and maintaining genetic 
connectivity among them (6). Recent forecasts 
of reduced polar bear distribution and abun- 
dance (7) are based on broad-scale climate 
projections and the estimated number of days 
that bears can fast without sea ice, which they 
require for hunting. These forecasts do not 
consider the role of small-scale habitat features 
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that, although of limited availability, could 
serve as climatic refugia. Here, we combine 
36 years of movement, genetic, and demo- 
graphic data to describe a genetically distinct 
and functionally isolated group of polar bears 
in Southeast Greenland. We use these data to 
explore the prospects for polar bear survival 
in a warmer, ice-free Arctic. 

The East Greenland subpopulation, one of 
19 recognized subpopulations of polar bears, 
occupies a marine area of ~700,000 km? along 
a mostly uninhabited ~3200-km coastline 
(8, 9). In 2011, we initiated a multiyear project 
to assess the status of East Greenland bears 
using movements, genetics, and demography 
(Fig. 1 and figs. S1 and $2). Intriguingly, satel- 
lite telemetry revealed that bears living south 
of 64°N, hereafter Southeast Greenland (n = 
27 adult females, 2015-2021; table S1), did not 
interact with bears living north of 64°N, here- 
after Northeast Greenland (nm = 56 adult 
females, 1993-2021; Fig. 2A). Instead, female 
bears in Southeast Greenland made localized 
movements in fjords and at marine-terminating 
glacier fronts (Fig. 2B) with a median 4-day 
movement rate of 10 km (8.4 to 13.1 km, 25 and 
75% quartiles; fig. S3). Some bears remained 
in one fjord for years, while others moved be- 
tween adjacent fjords, traveling over mountains 
via the ice sheet and peripheral glaciers (fig. 
S4). In contrast, the median 4-day movement 
rate of female bears in Northeast Greenland 
was 40 km (30.0 to 49.0 km; fig. $3), with an- 
nual movements of >1500 km on the offshore 
sea ice. Recaptures of males indicated similar 
movement patterns (10). The degree of geo- 
graphic isolation of Southeast Greenland 
bears distinguishes them from other sub- 
populations (1). 
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Current sea-ice conditions in Southeast 
Greenland, like those projected for the late 
21st century in the High Arctic, appear un- 
suitable to support polar bears. Pack ice (freely 
floating sea ice) is intermittently flushed into 
fjords but is largely unusable owing to its low 
concentration and transience (Fig. 2C and figs. 
S5 to S10). Southeast Greenland bears use fast 
ice (sea ice frozen to the coastline) when it is 
available in winter and spring (figs. S11 to S15), 
but the number of days per year with fast-ice 
coverage is low and variable (between 0 and 
153 days; mean: 89 days) (fig. S15 and table 
$2). Fast ice typically forms in February and 
disappears by late May, so the region is sea 
ice-free for more than 250 days per year, ex- 
ceeding polar bears’ estimated seasonal fast- 
ing threshold of 100 to 180 days (7). Our data 
show that Southeast Greenland bears use gla- 
cial mélange (freshwater ice at glacier fronts) 
as a platform for hunting during the sea ice- 
free season (Fig. 2, D and E), whereas in most 
parts of the Arctic, polar bears must move onto 
land or track the receding sea ice northward 
into the less productive polar basin (77). 

Southeast Greenland bears appear to have 
adapted their movements to the region’s spe- 
cific physical geography. The high-velocity 
East Greenland Coastal Current (72) season- 
ally brings a narrow band of low-concentration 
pack ice south of 64°N and around the southern 
tip of Greenland (figs. S8 to S10) (73). All tracked 
Southeast Greenland bears that moved out 
of the fjords (m = 11) became caught in this 
current’s drift ice and were transported south- 
ward toward Cape Farewell, drifting an aver- 
age of 189 km in <2 weeks (fig. S4). Notably, all 
swam ashore and walked via land to their home 
fjord within 1 to 2 months, demonstrating high 
site fidelity. Bears in Southeast Greenland must 
remain inside fjords or risk export to human- 
inhabited areas of South Greenland or into the 
North Atlantic. 

Southeast Greenland bears are the most 
genetically isolated polar bears in the Arctic 
(figs. S16 to $25 and tables S3 to S8). Principal 
components analyses showed a distinct South- 
east Greenland cluster across 20 microsatellite 
loci isolated from 372 bears (Fig. 3A, fig. S16, 
and table S3), 40 whole genomes (Fig. 3B, 
fig. S21, and table S7), and whole-blood tran- 
scriptomic data from 16 bears (Fig. 3C and 
table S8) (70). In a Bayesian clustering analysis 
of 3064 bears from all subpopulations except 
the Arctic Basin, Southeast Greenland bears 
were distinct from all subpopulations across 
the species’ circumpolar range when the model 
assumed four clusters (Fig. 3D and figs. S18 
to S20) (10). Norwegian Bay bears, previously 
identified as the most distinct subpopulation 
(14-16), emerged at five clusters. Southeast 
Greenland bears also have the lowest levels of 
heterozygosity across microsatellites (table S4) 
and whole genomes (fig. $25), as well as the 
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of whole-genome and RNA samples (2015-2017) and 


highest proportion of their genomes in runs of 
homozygosity (fig. S25) compared with other 
polar bear subpopulations. 

Estimates of genetic divergence measured 
as Fgr showed that Southeast Greenland bears 
are more genetically diverged from neighbor- 
ing groups of bears [in Northeast Greenland 
(Fgr = 0.059) and the Davis Strait subpopu- 
lation (Fs; = 0.086)] than any other neighbor- 
ing subpopulation pair (table S5). The pair 
with the next-highest Fg; (M’Clintock Chan- 
nel and Foxe Basin, Fs; = 0.039) are separated 
by a land boundary, which is less conducive 
to polar bear movement than a water-ice 
boundary. Norwegian Bay has even lower Fgr 
values with its neighbors (Kane Basin, Fg; = 
0.026; Lancaster Sound, Fs; = 0.027). The ge- 
netic distinctiveness of Southeast Greenland 
bears may be due in part to the rapid directional 
flow of the East Greenland Coastal Current, 
which limits both northward emigration and 
southward immigration. In support of this 
hypothesis, bears genotyped from South and 
Southwest Greenland did not cluster with 
Southeast Greenland, their nearest neigh- 
bor. Instead, they clustered with Northeast 
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Greenland bears (Figs. 1 and 3A), suggesting 
that they originated in Northeast Greenland 
and were subsequently transported south on 
the sea ice, past Southeast Greenland’s fjords 
and around Cape Farewell. Davis Strait bears 
also were more similar to Northeast Greenland 
bears than to Southeast Greenland bears (table 
85), suggesting gene flow from Davis Strait 
around the northern coast of Greenland. 
Demographic data are consistent with South- 
east Greenland bears functioning as a distinct 
subpopulation. Birth rates were low compared 
with Northeast Greenland and most other 
polar bear subpopulations (0, 17). We found 
no differences in litter sizes between Southeast 
Greenland and Northeast Greenland subpop- 
ulations, which suggests similar cub survival 
(tables S9 and S10). Adult female body mass, 
an important determinant of cub production 
and survival (78), was lower in Southeast 
Greenland than in several other subpopulations 
but was similar to that of Northeast Greenland 
bears and the Barents Sea subpopulation (fig. 
$26 and table S11), both of which use glacial 
fjords (19). It is possible that low birth rates are 
influenced by highly fractured fjord and moun- 
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tain habitats that reduce bear movements and 
impede breeding pair formation (20). 

The distinctive movement, genetic, and dem- 
ographic patterns of Southeast Greenland 
bears may have evolved over several hundred 
years of isolation. The earliest reference to 
polar bears in South Greenland dates to the 
1300s (20), and the first written Western record 
of bears in Southeast Greenland fjords is from 
the 1830s (22). These records are consistent 
with analyses of rare single-nucleotide poly- 
morphisms [maximum allele frequency of 
5%; (J0)] in the whole-genome data, which 
suggested that sampled Southeast Greenland 
bears share a common ancestor ~200 years ago 
(95% confidence interval: 189 to 264 years ago; 
fig. S22) and thus have been genetically isolated 
for at least this long. 

Southeast Greenland bears have few op- 
portunities for dispersal. Their habitat com- 
prises steep coastal topography with fjords 
separated by >2000-m mountains and narrow 
glaciers. To the west, dispersal is restricted by 
the Greenland Ice Sheet. To the east is the open 
water of the Denmark Strait. Northward move- 
ment is limited by the East Greenland Coastal 
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Fig. 2. Polar bear movements and habitat types. (A) Area use for Northeast 
Greenland (NEG) and Southeast Greenland (SEG) polar bears based on 
satellite telemetry from 1993 to 2021. SEG bears range over ~400 linear km 
of mountainous and glaciated coastal habitat (~27,749 km? within 95% 
probability contours), whereas NEG bears range over >2000 linear km of pack 


Current (12), by katabatic winds that flush the 
region of sea ice around Kage Bay (65°N), and 
by subsistence hunting around the commu- 
nity of Tasiilaq. To the south, polar bear habitat 
disappears at the southern tip of Greenland, 
and both South and Southwest Greenland are 
inhabited by humans who hunt polar bears. 
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Despite limited options for dispersal, South- 
east Greenland bears occasionally capture ge- 
netic diversity from other subpopulations, and 
our observations suggest that immigrants can 
adapt to the distinctive environment. Research- 
marked bears have been observed in Southeast 
and Southwest Greenland after being tagged 
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ice and fast ice and 671,208 km? (95% probability contours). NEG and SEG are 
separated at about 64°N latitude. (B) Fjord fast-ice habitat in SEG, present 

<4 months per year. (©) Low-concentration drift ice with open water on the 
horizon in Denmark Strait. (D) SEG polar bear using glacial ice at 62°N in 
September 2016. (E) Polar bear with a fresh seal kill at 61°N in September 2016. 


in Franz Josef Land (Russia), Svalbard, Baffin 
Island, and the eastern Beaufort Sea (9). Our 
whole-genome dataset identifies two individ- 
uals as potential immigrants: one that clusters 
closer to the Southern Beaufort Sea and Chukchi 
Sea subpopulations in Alaska, and another that 
clusters closer to Northeast Greenland (Fig. 3B). 
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Fig. 3. Population structure. (A) Primary axis of genetic principal components 
analysis (PCA) versus latitude of Greenland bears. Most SEG bears (red) 
cluster separately from NEG bears (blue). Bears from South and Southwest 
Greenland (black) cluster with NEG bears. (B) Genomic PCA showing that SEG 
bears cluster separately from other subpopulations, except individuals D24051 
and D24087, which cluster with Chukchi Sea (CS)/Southern Beaufort (SB) 

and NEG bears, respectively. (©) Transcriptomic PCA of East Greenland samples. 
SEG (red) and NEG (blue) bears show different gene expression profiles. Axes 


We detected little evidence of admixture in 
either bear (Fig. 4 and fig. S23). Combined with 
ancestry analysis (fig. S22), this suggests that 
these individuals were recent immigrants 
into Southeast Greenland. While these data 
do not establish successful gene flow into 
Southeast Greenland, microsatellites identify 


Laidre et al., Science 376, 1333-1338 (2022) 


SB 


NW 


NB 


some Southeast Greenland bears with mixed 
ancestry (Fig. 3A and figs. S18 to S20), suggest- 
ing that immigrant bears adapt sufficiently to 
interbreed with resident bears. 

Although our study was not designed to esti- 
mate abundance, data from marked individuals 
suggest that a few hundred animals currently 
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inhabit Southeast Greenland, similar to other 
small polar bear subpopulations (10, 17). Human- 
caused mortality is unlikely to limit the num- 
bers of Southeast Greenland bears. Human 
settlements in the Skjoldungen-Timmiarmiit 
area were abandoned in the mid-1960s. Small 
numbers of subsistence hunters lived along 
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the coast until the late 1970s (23), but it is 
uncommon for polar bear hunters to go to 
Southeast Greenland today (24). 

We propose that Southeast Greenland polar 
bears are a resident, functionally and demo- 
graphically isolated group that warrants con- 
sideration as the 20th subpopulation of polar 
bears by the International Union for Conser- 
vation of Nature. Southeast Greenland bears 
are distinct according to the same lines of 
evidence used to delineate other subpopula- 
tions, most of which exhibit substantially 
more genetic mixing (J/, 16). Maintenance of 
the distinctive Southeast Greenland lineage is 
necessary to preserve the genetic diversity of 
the species. 

The existence of the Southeast Greenland 
subpopulation has implications for conser- 
vation. In most of the Arctic, glacial mélange 
habitats are uncommon, and the future of 
polar bears will be closely tied to the availa- 


y from SEG as compared with the subpopulations 
| combinations of individuals from the specified 
ual is admixed. The region of statistical significance 


and Svalbard, glacial mélange habitat is avail- 
able and occupied year-round by ringed seals, 
the polar bear’s primary prey (25) (Fig. 2, D 
and E). We show that some polar bears can 
become established and adapt to this distinc- 
tive environment, which, in some areas, may 
provide a buffer to sea-ice loss. The long-term 
role of refugia in the persistence of polar bears 
is unknown (26). Even glacial mélange habitat 
is changing, because the Greenland Ice Sheet 
coastal margin is undergoing widespread re- 
configuration (27). Continued monitoring of 
Southeast Greenland bears will be critical to 
understand how the species will be affected 
by, and respond to, anthropogenic climate 
warming. 
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Structure of the mammalian ribosome as it decodes 
the selenocysteine UGA codon 


Tarek Hilal’*+, Benjamin Y. Killam, Milica Grozdanovi¢?+, Malgorzata Dobosz-Bartoszek§, 
Justus Loerke’, Jorg Biirger’?, Thorsten Mielke®, Paul R. Copeland‘, 


Miljan Simonovié?*{, Christian M. T. Spahn’* 


The elongation of eukaryotic selenoproteins relies on a poorly understood process of interpreting 
in-frame UGA stop codons as selenocysteine (Sec). We used cryo-electron microscopy to visualize 
Sec UGA recoding in mammals. A complex between the noncoding Sec-insertion sequence (SECIS), 
SECIS-binding protein 2 (SBP2), and 40S ribosomal subunit enables Sec-specific elongation factor 
eEFSec to deliver Sec. eEFSec and SBP2 do not interact directly but rather deploy their carboxyl-terminal 
domains to engage with the opposite ends of the SECIS. By using its Lys-rich and carboxyl-terminal 
segments, the ribosomal protein eS31 simultaneously interacts with Sec-specific transfer RNA 
(tRNAS°°) and SBP2, which further stabilizes the assembly. eEFSec is indiscriminate toward L-serine 
and facilitates its misincorporation at Sec UGA codons. Our results support a fundamentally distinct 
mechanism of Sec UGA recoding in eukaryotes from that in bacteria. 


ranslating ribosomes pause at stop 
codons—UAA, UAG, and UGA—which 
allows a protein release factor (RF) to 
bind and terminate protein synthesis. 
In a subset of mRNAs from most or- 
ganisms across all domains of life, in-frame 
UGA codons recruit selenocysteinyl-tRNA 
(Sec-tRNA®) to facilitate the stop-to-Sec re- 
coding. The recoding leads to the synthesis of 
selenoproteins, which are required for a 
myriad of functions, notably the mainte- 
nance of redox and thyroid hormone homeo- 
stasis and protection of the cell membrane 
and DNA from oxidative damage (7). An em- 
bryonically lethal phenotype of the mouse 
mutant in which tRNAS” was deleted (2) and 
systematic analyses of genetic rodent models 
of selenoprotein deficiency (3, #4) demonstrate 
that selenoproteins are essential for vertebrate 
survival. Selenoprotein deficiency and muta- 
tions in selenoproteins cause systemic, often 
lethal diseases in humans (5). Despite the 
biological importance, the mechanism of 
Sec UGA recoding in eukaryotes is not well 
understood. 
Stop-to-Sec recoding relies on a Sec-insertion 
sequence (SECIS) in the selenoprotein mRNA 
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and a Sec-specific elongation factor—SelB 
in prokaryotes and eEFSec in eukaryotes. 
Although the bacterial SECIS follows Sec UGA 
within the open reading frame, the phyloge- 
netically unrelated eukaryotic SECIS is in the 
3'-untranslated region (3’-UTR). Prokaryotic 
SelB facilitates recoding on its own, but eEFSec 
requires a eukaryote-specific protein factor, 
SECIS-binding protein 2 (SBP2). Despite some 
conservation, the prokaryotic mechanism (6) 
cannot be extrapolated to eukaryotes (7). This 
raises questions about the architecture of 
the eukaryotic UGA recoding assembly (the 
“selenosome”), the role of SECIS and SBP2, 
and the molecular choreography that governs 
discrete steps of the process. To address these 
questions, we reconstituted human eEFSec, 
SBP2, and Ser-tRNA®® on mammalian 80S 
ribosomes that were programmed with an 
mRNA containing an authentic SECIS element. 
Using cryo-electron microscopy (cryo-EM), we 
visualized the early steps of Sec UGA recoding 
in higher eukaryotes, providing a structural 
basis for the distinct mechanism of seleno- 
protein elongation. 

To produce a stable complex and position 
the UGA codon in the ribosomal decoding 
center (DC), we used a chimeric mRNA con- 
struct that harbored a cricket paralysis virus 
(CrPV) internal ribosome entry site (TIRES) on 
its 5’ end (8). Sec UGA, the first coding triplet, 
is followed by ~900 nucleotides (nt) of firefly 
luciferase coding region and the 3’-UTR of rat 
glutathione peroxidase 4 (Gpx4) that contains 
the SECIS. We combined the chimeric mRNA, 
purified rabbit ribosomal subunits, and the 
functional C-terminal half of human SBP2 
(residues 409-854), which is composed of a 
Sec-insertion domain (SID) and an RNA- 
binding domain (RBD) (Fig. 1A). Concur- 
rently, we assembled a ternary complex of the 
guanosine triphosphatase (GTPase)-defective 
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His®°—-Ala mutant of eEFSec (eEFSec-H96A), 
Ser-tRNA*, and guanosine 5’-triphosphate 
(GTP). We chose eEFSec-H96A to avoid com- 
plex disassembly from GTP hydrolysis and 
Ser-tRNAS“ because of its similarity with Sec- 
tRNA‘, In the end, programmed ribosomes 
and ternary complex were assembled into the 
recoding complex immediately before vitrifi- 
cation (Fig. 1A). 

From 13,921 selected micrographs, a total of 
1,685,923 particle images were extracted and 
analyzed by means of multiparticle refinement 
(figs. SI to S3 and table S1). A subpopulation of 
77,142 particle images that corresponded to 
the intact selenosome yielded a reconstruction 
of 2.8 A resolution (Fig. 1 and figs. S1 and $2). 
All recoding factors form an extended network 
of interactions (Fig. 1, B to E). The density for 
SECIS, a member of the kink-turn family of 
RNA structural motifs, is well resolved and is 
adjacent to the beak domain of the 40S. Near 
the SECIS core, densities for a segment of the 
SID and the entire RBD of SBP2 are readily 
visible (Figs. 1E and 2A). The opposite end of 
the SECIS contacts the C-terminal domain 4 
(D4) of eEFSec, whereas D1, D2, and D3 reside 
at the GTPase-associated center (GAC). The ac- 
ceptor and variable arms of Ser-tRNA*®® are 
bound to eEFSec (Fig. 2A), and the anticodon 
loop is properly positioned in the A site of 
the DC, which suggests that we captured the 
preaccommodated state of the selenosome with 
the tRNA‘ in the A/T conformation (fig. $4). 
The long mRNA segment that connects UGA 
and the SECIS is partially disordered and could 
not be modeled. The CrPV IRES is in the trans- 
located state (8), which points away from the 
DC and rests against ES30L of the L1 stalk in 
the 60S (Fig. 1, C and D). We did not observe 
any interactions between the CrPV IRES and 
eEFSec, SBP2, Ser-tRNA*“, or the SECIS ele- 
ment. The absence of such interactions and 
the general similarity of our visualized recod- 
ing complex to canonical mammalian decod- 
ing complexes (9, 10) suggest its physiological 
relevance. 

Our reconstruction revealed the structure 
of the eEFSeceGTP+Ser-tRNA* complex (Figs. 
1E and 2A). eEFSec-H96A resembles crystal 
structures of wild-type (WT) eEFSec (7) and 
archaeal SelB (72) (fig. S5A). When compared 
with the bacterial SelB, the structural conser- 
vation is preserved in D1, D2, and D3 but ab- 
sent in D4 (fig. S5B). GTP is bound to the 
GTPase pocket in D1, but the side chains of 
switch 2 are disordered, which confirms the 
catalytically incompetent conformation of 
eEFSec-H96A (fig. S6A). D1 and D2 are sand- 
wiched between H95 and uL14 of the 60S and 
hd and hi4 of the 40S (Fig. 2B), whereas D4 is 
wedged between the apical loop of SECIS and 
h33 of the 40S (Fig. 2D). The CCA end of 
tRNA*© positions the Ser group into the pro- 
posed Sec-binding pocket. Highly conserved 
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Ser, Gin?”, and His” surround Ala” (fig. 
S7), and H-bonds with Gln” lock its nucle- 
obase in place (Fig. 2C). Phe?”? and Arg?®° 
cap the Sec-binding pocket, with Arg?8? stabi- 
lizing the pocket through interactions with 
Thr”. This explains the loss of function in 
Are”>_,Ala (R285A) and Arg”*°—Asn (R285N) 
mutants (12). The hydroxyl of Thr™*” is ~3.7 A 
from the hydroxyl of Ser on Ser-tRNA*“ (Fig. 
2C), which suggests its importance for amino- 
acid selection. When accounting for the longer 
C-Se bond, the Se atom of Sec would be at the 
optimal distance of ~2.6 to 3.2 A from Thr”. 
Last, the eukaryote-specific loop B24-B25 
(residues 522-524) in D4 of eEFSec forms 
an interface with the backbone of the AAR 
motif of SECIS (residues 1128-1130) and h33 
(residues 1305-1308) of the 40S subunit (Fig. 2, 
D and E). Using a well-established Sec UGA 
readthrough reporter assay (Fig. 2F) (73), we 
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show that eEFSec promotes recoding in an 
SBP2- and SECIS-dependent manner in the 
presence of Ser-tRNA*® but not Ser-tRNAS*™ 
(Fig. 2G). Also, replacing Thr**? with either Val 
(T242V) or Leu (T242L) and Phe®”” and Gin“ 
with Gly (F522G and Q524G) causes impair- 
ment of the readthrough activity (Fig. 2H). 
Taken together, our results argue that Thr?*” 
is a key selectivity residue in human eEFSec. 
Also, just as Cys is misincorporated at Sec 
UGA in TXNRD1 under low Se levels (14), 
the same may occur with Ser. However, it 
remains to be seen whether Ser misinsertion 
occurs in vivo and whether selenoenzyme ac- 
tivity is affected. 

Human eEFSec uses all its domains to en- 
gage the tRNA‘ (fig. S6, center). D1, D2, 
and D3 bind to the acceptor-TVC arm (fig. S6, 
B and C), whereas the D3-D4 linker and D4 
contact the variable stem and loop (fig. S6, 
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vantage points of the 40S (left) and 60S (right) subunits. (E) 3D map 
covering the eEFSec*GTP+Ser-tRNAS°°*SECIS*SBP2 complex (cartoon). 
60S is gray, 40S is sand, eEFSec*GTP is dark red, Ser-tRNAS°° is 

dark green, SECIS is blue, SBP2 is dark orange, and the mRNA harboring 


D and E). The side chain of Glu??? estab- 
lishes the tRNA*“ identity through H-bonds 
with the Watson-Crick face of the Gly” re- 
cognition base (fig. S6B). Arg*”” and Asp*** 
from loop 819-820 of D3 contact the minor 
groove of the TVC arm (fig. S6C). The D3-D4 
linker runs parallel to the variable arm where 
Lys*” interacts with a nonbridging oxygen be- 
tween G47a and U47b (fig. S6D), and the var- 
iable loop is lodged against loop B22-a12 
(residues 495-499) of D4 (fig. S6E). These 
interactions explain why mutations in and 
deletion of D4 had detrimental effects on 
eEFSec activity and selenoprotein synthesis 
(15). Although they share a conserved bio- 
logical role, the eukaryotic selenosome is dis- 
tinct from the bacterial one (fig. S8A), which 
suggests divergent UGA recoding mechanisms. 
In particular, the C-terminal D4 of eEFSec 
interacts with the variable arm of tRNA*” and 
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Fig. 2. Human eEFSec*GTP-Ser-tRNA®® on the 80S ribosome. (A) Side view 
of the eEFSec complex (cartoon) as bound to the 80S. (B) Dl and D2 of eEFSec 
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H-bonds. (D) D4 of eEFSec is between the AAR motif of SECIS and h33 of the 40S. 


The view is rotated 90° clockwise around the vertical axis relative to (A). (E) Close- 
up view of the interface between AAR motif, h33, and D4. (F) The Sec UGA 
readthrough assay based on the luciferase reporter. (G) Ser-tRNA°®, but not 
Ser-tRNA°*, supports the SBP2-SECIS-dependent Sec UGA readthrough. Activity 
levels are presented as fold change over the control sample. White and gray bars 
correspond to WT and AUCC mutant SECIS, respectively. (H) T242L, T242V, 

and F522G/Q524G mutants lost the readthrough activity. 


points away from the mRNA channel (fig. $8, 
A and B), whereas the bacterial D4 is rotated 
~90° around the linker, does not interact with 
the variable arm, and binds near the mRNA 
entry channel (fig. S8C) (6). 

The mammalian SECIS, which is derived 
from the 3'-UTR of Gpx4 (Fig. 3A), adopts a 
Form II structure that is characterized by 
two nearly coaxial stems that connect the 
GA quartet and AUGA bulge on one end and 
two loops and the essential AAR motif on the 
opposite, apical end (Fig. 2G). The basal stem, 
or helix I, is disordered in our map. The AUGA 
bulge, or the SECIS core, folds into a kink-turn 
motif, which serves as the SBP2-binding site 
(Fig. 3, A and B). Sitting atop Leu’, U1112 
forms H-bonds with the invariant Arg”! of 
SBP2, which suggests its relevance for complex 
formation (Fig. 3C). The rest of the bulge is 
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structurally important, as illustrated by the 
inability of the AUGA—AUCC mutant to sup- 
port recoding (Fig. 2G). On the opposite end are 
the apical stem, the apical loop, and the AAR 
motif, which is characterized by three unpaired 
adenosines (Fig. 3A). Replacing unpaired AAA 
with AUG hinders recoding (16), but the ab- 
sence of sequence-specific interactions with the 
AAR motif may rationalize why some SECIS 
elements carry the CCR motif instead (/7). 
Nonetheless, our structure uncovered insights 
about SECIS, a noncoding RNA element that 
regulates selenoprotein synthesis in higher 
organisms. 

We modeled the N-terminal segment of 
the SID (residues 429-475) and RBD (residues 
625-780) of SBP2 (Fig. 3B). Residues 429-437 
fold into a fishhook-like structure that anchors 
against h33 of the 18S ribosomal RNA (rRNA). 
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The subsequent segment (residues 441-446) 
forms a parallel B-strand that leans against the 
C terminus of eS31, which effectively expands 
the B sheet of the zinc-finger motif of this 
ribosomal protein (Fig. 3B). With its Lys-rich 
motif, eS31 reaches the anticodon arm of 
tRNA®*® (fig. S9), but the 84 N-terminal resi- 
dues are disordered, which allows expan- 
sion of the A site and accommodation of an 
enlarged tRNA*® variable arm. Residues 
448-455 of the SID run along the major groove 
of SECIS (Fig. 3B) without establishing sequence- 
specific contacts. The SID in our structure ends 
with an a helix (residues 456-475), which is 
almost perpendicular to helix II of SECIS and 
is near the B1-o3 loop (“IQSKG”) of the 
RBD (Fig. 3B). The ribosomal protein uS19 
binds to the opposite side of helix II, which 
further stabilizes the complex (Fig. 3B). Using 
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luciferase reporter assay (Fig. 2F), we show 
that SBP2 that lacks residues 403-428 (SBP2- 
A428), but not 403-476 (SBP2-A476), supports 
Sec UGA readthrough (Fig. 3E). This suggests 
that residues 429-476 of the SID are im- 
portant for binding of SBP2 to the ribosome 
and explains why mutations in the same region 
of rat SBP2 substantially diminished Sec incor- 
poration efficiency (18, 19). Furthermore, RBD 
of SBP2 adopts an L7Ae protein fold and binds 
to the conserved kink-turn motif of the SECIS. 
The RBD is locked in place through interac- 
tions of conserved **RFQDR®®, *’DPVKA*™, 
and °°VLKHLKL®** motifs (fig. S10) with 
ribosomal protein uS19 and h41 of the 40S 
(Fig. 3B). This illuminates why mutations 
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in these motifs hindered ribosome binding 
and Sec incorporation activity, but not SECIS 
binding (79). Last, we rationalize effects of 
the disease-associated missense mutations 
Glu°”’—+Asp (E679D) and Cys®'—Arg (C691R) 
(fig. S11A) (20, 27). Although the effects of 
E679D are neutral, C691R probably compro- 
mises SBP2 structure because of steric clashes 
of Arg" with Ile’, Pro™, Val?%, Ne’, Phe”, 
and Met” (fig. S11, B and C). 

We investigated whether binding to the 80S 
of SBP2°SECIS alone could poise the ribosome 
for Sec UGA recoding. To this end, we deter- 
mined the structure of the 80S*SBP2°SECIS 
complex at 3.1 A resolution (Fig. 4A and fig. $3). 
We found that SBP2 and SECIS are bound to 
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core (asterisk). The SID interacts with the zinc finger of eS31 (arrow). 
Contacts between helix Il of the SECIS and uS19 and eS31 stabilize the complex. 
(C) Close-up view of interactions between Arg’*" of RBD and U1112 of SECIS. 
(D) N-terminal segment of SID (residues 442-446) forms a parallel B-strand to 
the C terminus of eS31. (E) Only deletion of residues 403-476 from SBP2 
(A403-476) abolishes Sec UGA readthrough activity. 


the beak of the 40S in the same manner as in 
the preaccommodated state structure of the 
complete selenosome (Fig. 4A). The mRNA 
follows a similar trajectory, which suggests 
that its conformation is independent from 
the eEFSec ternary complex binding and the 
recoding step itself. We thus conclude that the 
SBP2¢ SECIS binding to 80S is a prerequisite 
for the eEFSeceGTP+Sec-tRNA*” ternary com- 
plex anchoring, which is consistent with the 
observation that the eukaryotic SECIS pro- 
motes Sec incorporation in cis when placed 
>55 nt downstream of the Sec codon (22). 
Last, SBP2°SECIS is not found in a position 
to prevent translation termination, which 
explains why RFs terminate selenoprotein 
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synthesis at UGA codons when Sec insertion 
fails (23, 24). 

Although we cannot rule out that IRES and 
Ser-tRNA*® may have imposed kinetic and 
conformational constraints on programmed 
ribosomes, our structures allowed visualization 
of the long-sought early steps of selenoprotein 
elongation at Sec UGA codons in eukaryotes, 
leading to a revised model of the process 
(Fig. 4B). During translation of selenoprotein 
mRNAs, the SBP2eSECIS complex binds to 
the head domain of the 40S subunit. The RBD 
of SBP2 binds to the SECIS base, whereas 
the N-terminal part of the SID latches onto 
the 40S through contacts with the rRNA and 
eS31. The prebound SBP2eSECIS forms a 
docking site on the stalled ribosome for 
eEFSeceGTPeSec-tRNA* (Fig. 4B). The EF- 
Tu-like domain of eEFSec binds to the GAC, 
and D4 binds to the apical loop of the SECIS, 
opposite from the RBD. eEFSec clasps the 
A/T conformation of Sec-tRNAS® with the 
variable arm contacting the Lys-rich domain 
of eS31, which completes the preaccommo- 
dated state of the selenosome. Subsequent 
events remain unclear, which warrants de- 
tailed structural analyses. After adopting 
the GTPase activated state, the CCA end of 
Sec-tRNA*“ is liberated from eEFSec. eEFSec 
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Meteorin-like promotes heart repair through 
endothelial KIT receptor tyrosine kinase 


Marc R. Reboll"”, Stefanie Klede“?, Manuel H. Taft®, Chen-Leng Cai’, Loren J. Field®, Kory J. Lavine®, 
Andrew L. Koenig®, Jenni Fleischauer’, Johann Meyer’, Axel Schambach’, Hans W. Niessen®, 
Maike Kosanke®, Joop van den Heuvel’°, Andreas Pich”, Johann Bauersachs“, Xuekun Wu?, 
Linqun Zheng, Yong Wang", Mortimer Korf-Klingebiel*, Felix Polten’, Kai C. Wollert)2* 


Effective tissue repair after myocardial infarction entails a vigorous angiogenic response, guided by 
incompletely defined immune cell—endothelial cell interactions. We identify the monocyte- and 
macrophage-derived cytokine METRNL (meteorin-like) as a driver of postinfarction angiogenesis and 
high-affinity ligand for the stem cell factor receptor KIT (KIT receptor tyrosine kinase). METRNL 
mediated angiogenic effects in cultured human endothelial cells through KIT-dependent signaling 
pathways. In a mouse model of myocardial infarction, METRNL promoted infarct repair by selectively 
expanding the KIT-expressing endothelial cell population in the infarct border zone. Metrni-deficient mice 
failed to mount this KIT-dependent angiogenic response and developed severe postinfarction heart 
failure. Our data establish METRNL as a KIT receptor ligand in the context of ischemic tissue repair. 


cute myocardial infarction (MI) is a 

common cardiac emergency triggered 

by sudden coronary artery thrombosis 

and occlusion (J). The adult mammalian 

heart has limited regenerative capacity 
(2). Ischemic tissue injury sustained during 
acute MI therefore leads to scar formation and 
may result in left ventricular (LV) chamber re- 
modeling and heart failure (3). 

Effective tissue repair after MI involves a 
vigorous angiogenic response that commences 
in the infarct border zone and extends into 
the necrotic infarct core. Neovessel formation 
after MI mitigates scarring and worsening of 
heart function and may represent a therapeu- 
tic target (4). Monocytes (Mos) and macro- 
phages (Mphs) accumulating in the infarct 
region drive postinfarction angiogenesis (5) 
by secreting proteins that impart signals to 
nearby endothelial cells (ECs) expressing their 
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cognate receptors (6, 7). The full complexity of 
this intercellular cross-talk that shapes angio- 
genesis and functional adaptation after MI re- 
mains incompletely understood (8, 9). 

We conducted a bioinformatic secretome 
analysis in a mouse model of acute MI to dis- 
cover previously uncharacterized myeloid cell- 
derived growth factors that drive infarct repair 
(fig. S1, A and B). We thus identified the 
30-kDa protein METRNL (meteorin-like) as 
being strongly expressed by myeloid cells in 
the infarct region of the left ventricle (table S1). 
METRNL is known to be secreted by Mphs 
during inflammation (J0-12) and to promote 
metabolic adaptation and tissue protection 
under stressful conditions (12-74). The METRNL 
receptor is unknown. 

METRNL was weakly expressed in the heart 
under sham-operated baseline conditions but 
strongly induced after MI (fig. S2A). METRNL 
was also abundantly expressed in myocardial 
tissue specimens from patients with acute MI 
(fig. S2B). As shown by confocal microscopy, 
METRNL-expressing cells coexpressed the 
myeloid cell marker CD11b and were often 
located in the vicinity of ECs in the infarct 
border zone (Fig. 1A). Quantitative reverse 
transcription polymerase chain reaction anal- 
ysis identified Mos and Mphs as the main 
Metrnil mRNA-expressing cell types in the 
infarct region, bone marrow, spleen, and pe- 
ripheral blood (fig. S3A). Metrnl was broadly 
expressed in Mo and Mph clusters defined 
by single-cell RNA sequencing in the infarcted 
heart (fig. S3B). Delineating Mo and Mph sub- 
sets on the basis of chemokine (C-C motif) 
receptor 2 (CCR2) and chemokine (C-X3-C 
motif) receptor 1 (CX3CRI1) expression (15), we 
found Metrni to be more strongly expressed in 
CCR2"*" Mos and CCR2"2" CX3CRIMBH/1Ow 
Mphs than in CCR2!°Y Cx3CcR1™2" Mphs 
(fig. S3C). 
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We subjected Metrni-deficient (Metrnl ’ / ~) 
mice to acute MI to explore METRNL’s func- 
tion during infarct repair. Metrnil~ mice 
breed normally and display no apparent phe- 
notype (/1, 12). Indeed, Metrni~ mice had 
anormal heart size and preserved LV func- 
tion under sham-operated baseline conditions 
(table $2). After MI, however, Metrni”~ mice 
developed larger infarct scars than their wild- 
type (WT) littermates (fig. S4A), with more- 
pronounced LV dilatation and contractile 
dysfunction, as demonstrated by microcathe- 
terization (fig. S4B and table S2) and high- 
resolution echocardiography (fig. S4, C and D). 
This deleterious response was not related to 
differences in infarct size or cardiomyocyte 
apoptosis in the infarct border zone 24 hours 
after injury (fig. $4, E and F) or to altered 
inflammatory cell accumulation in the infarct 
region (fig. S4G). Yet new capillary formation 
in the infarct border zone was impaired in 
Metrni~ mice (Fig. 1B and fig. S4H). Myocar- 
dial capillary density under baseline con- 
ditions (Fig. 1B) or remote from the infarct 
region (fig. S41) was not reduced in Metrnl ’ /- 
mice, indicating that Metrni deletion spe- 
cifically impaired angiogenesis after ische- 
mic injury. 

Transplanting Metrnl : '“ mice with WT bone 
marrow cells reversed the angiogenic defect 
(Fig. 1C) and ameliorated LV scarring and 
remodeling after MI (fig. S5, A to C). Con- 
versely, transplanting WT mice with Metrni- 
bone marrow cells or deleting Metrni selec- 
tively in myeloid cells by crossing Metrnt’” 
mice with LysM“’”* mice (fig. S6, A and B) 
impaired postinfarction angiogenesis (Fig. 1, C 
and D) and worsened LV scarring and remod- 
eling (figs. S5, A to C, and S6, C to E). Together, 
these findings show that myeloid cell-derived 
METRNL promotes angiogenesis, tissue repair, 
and functional adaptation after MI. 

METRNL dose-dependently stimulated hu- 
man coronary artery endothelial cell (HCAEC) 
proliferation and migration after scratch in- 
jury (fig. S7, A and B), indicating that the 
protein directly acts on ECs. The sigmoidal 
dose-response relationships and METRNL’s 
low half-maximal effective concentration 
(<1 ng/ml) suggested that METRNL signals 
through a high-affinity receptor. Using chem- 
ical cross-linking mass spectrometry, we iden- 
tified KIT (KIT receptor tyrosine kinase) as a 
METRNL cell surface receptor candidate in 
ECs (fig. S8, A and B). KIT is the receptor for 
stem cell factor (SCF; also known as KIT ligand). 
KIT is expressed by various cell types, includ- 
ing hematopoietic stem and progenitor cells 
and germ cells. SCF signaling via KIT plays 
multifaceted roles during development and 
is important for normal hematopoiesis and 
fertility (76). Most KIT-expressing cells in the 
adult mouse myocardium are ECs that retain 
their endothelial identity after MI (17-20). 


lof5 


RESEARCH | REPORT 


A |, METRNL IB4 DAPI ——  B 
WT § Metrnr- 


METRNL DAPI CD11b DAPI Merge Cc 
WT bs 


CD31* cells (per CM) 


Fig. 1. Myeloid cell-derived METRNL promotes angiogenesis after 
myocardial infarction. (A) Confocal immunofluorescence microscopy 
images taken from the infarct border zone 3 days after MI in WT or Metrni-”— 
mice. Sections were stained with 4',6-diamidino-2-phenylindole (DAPI), 
fluorescein-labeled isolectin B4 (IB4), and antibodies against METRNL 

and CD11b. Of all METRNL-expressing cells, 92 + 5% coexpressed CD11b 
(data from four mice). Scale bars, 25 um. (B) WT and Metrnl’”~ mice 
underwent sham or MI surgery and were followed for 6 days (sham) or 

up to 28 days (MI). Fluorescent images depict CD31* ECs in the infarct border 
zone on day 28. Extracellular matrix and cardiomyocyte (CM) borders 

are highlighted by wheat germ agglutinin (WGA) staining. Scale bar, 50 ym. 
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Fig. 2. METRNL binds to the extracellular domain of KIT. (A) Conditioned 

supernatants (SN) were prepared from HEK-293 cells transfected with expression 
plasmids encoding human KIT-ECD-Fc, METRNL, SCF, and/or VEGFA. Pull-down of 
KIT-ECD-Fc using protein G-coated beads copurified METRNL and SCF. Exemplary 
results from three experiments are shown. M,, relative molecular mass. (B) Protein- 
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Summary data are from five mice per group. One-way analysis of variance 
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OVA) was performed with Dunnett test (MI versus same genotype sham), 


as well as independent-samples t test (WT versus Metrni’’-). (C) Bone 
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row cells from Metrni-”~ [knockout (KO)] or WT mice were transplanted 


into (—) lethally irradiated KO or WT recipients. MI was induced after 
bone marrow reconstitution, and CD31" capillary density in the infarct border 
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e was determined on day 28. Independent-samples t test was performed. 
CD31* capillary density in the infarct border zone in Metrni" mice 


and Metrni" LysM°°“* (Metrni™" Cre) mice on day 28. Two-way ANOVA 
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h Tukey test was performed. (B to D) Data are means + SEM. *P < 0.05, 


**P < 0.01, ***P < 0.001. 
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protein interaction analysis by microscale thermophoresis. Binding curves illustrate 
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interaction of human METRNL and SCF with human KIT-ECD-Fc. Human 


VEGFA did not interact with KIT-ECD-Fc, nor did METRNL with an IgG1-Fe control 
protein. Data are means + SEM from three or four technical replicates; two 
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itional experiments yielded similar results. Ky, dissociation constant. 
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A SCF 5 min O SCF 15 min phosphoproteome datasets from HCAECs stimulated with METRNL, SCF, or VEGFA (four technical 
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Coimmunoprecipitation experiments in hu- 
man embryonic kidney 293 (HEK-293) cells 
expressing METRNL and KIT confirmed the 
physical interaction between the two proteins 
(fig. S9, A and B). After incubating human 
umbilical vein EC lysates containing endoge- 
nous KIT (27) with METRNL or SCF, we co- 
purified both ligands by immunoprecipitating 
KIT (fig. S9C). To assess whether METRNL 
binds to KIT’s extracellular domain, we prepared 
conditioned supernatants from HEK-293 cells 
expressing METRNL, SCF, vascular endothelial 
growth factor A (VEGFA), and/or a secreted 
KIT-extracellular domain-Fc fragment fusion 
protein (KIT-ECD-Fc). Pulling down KIT-ECD-Fe 
from the supernatants, we copurified METRNL 
and SCF, but not VEGFA (Fig. 2A). Microscale 
thermophoresis showed that METRNL and 
SCF bind to KIT’s extracellular domain with 
similarly high affinity (Fig. 2B and fig. S9D). 

We next explored whether METRNL medi- 
ates its angiogenic effects via KIT. In line with 
previous studies linking endothelial KIT ac- 
tivation to angiogenesis (2/7, 22), SCF stimu- 
lated HCAEC proliferation and migration 
after scratch injury (fig. S7, C and D), as we 
had observed with METRNL (fig. $7, A and 
B). Small interfering RNA-mediated down- 
regulation of KIT curtailed METRNL and 
SCF’s angiogenic effects but did not affect 
EC responses to VEGFA (fig. $7, E and F). 

SCF stimulation induces KIT homodimeri- 
zation and transphosphorylation of tyrosine 
residues 568 and 570 (Y568/Y570) in the 
juxtamembrane region of the receptor (23), 
thus creating a docking site for SRC family 
kinases (24). Treating HCAECs with METRNL 
or SCF induced KIT (Y568/Y570) phosphoryl- 
ation with similar kinetics (fig. S1OA). Add- 
ing recombinant KIT-ECD-Fc to the culture 
medium prevented METRNL and SCF from 
inducing KIT (Y568/Y570) phosphorylation 
(Fig. 3A), consistent with the idea that trans- 
membrane KIT and KIT-ECD-Fc compete for 
METRNL and SCF binding. Indeed, recombi- 
nant KIT-ECD-Fc dose-dependently inhibited 
the angiogenic effects of METRNL and SCF, 
but not those of VEGFA (Fig. 3B). 

Downstream of KIT, both METRNL and 
SCF induced SRC (¥416) and AKT1 (S473) phos- 
phorylation, again following comparable sig- 
naling kinetic trajectories (fig. S10, B and C). 
To gain a broader perspective, we applied high- 
resolution mass spectrometry to examine pro- 
tein phosphorylation dynamics in HCAECs 
stimulated with METRNL, SCF, or VEGFA. 
In a principal components analysis based on 
4750 phosphosites distributed among 1891 dif- 
ferent proteins, the phosphoproteome signa- 
tures of METRNL and SCF clustered together 
and were segregated from VEGFA’s signatures 
in principal component space (Fig. 3C). Un- 
supervised hierarchical clustering and plotting 
the data in a correlation matrix confirmed that 
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the phosphoproteomic responses to METRNL 
and SCF were closely related and distinct from 
the response to VEGFA (fig. S11, A and B). The 
angiogenic effects of METRNL and SCF are 
thus mediated through KIT and shared down- 
stream signaling pathways. 

We used single-cell RNA sequencing to iden- 
tify Kit-expressing ECs in the adult mouse myo- 
cardium. Whereas few ECs, scattered across 
several EC clusters, expressed Kit under base- 
line conditions, the number of Kit-expressing 
ECs markedly increased after MI (Fig. 4A). 
Kit was the only gene that was differentially 
expressed (logs ratio > 1, P < 0.05) in Kit- 
expressing versus Kit-nonexpressing ECs 
under baseline conditions. After MI, however, 
genes related to cell proliferation and migra- 
tion were highly enriched in Kit-expressing 
ECs, indicating that the cells had become ac- 
tivated (table S3). 

To explore whether KIT expression deter- 
mines EC susceptibility to METRNL and SCF 
stimulation, we isolated cardiac ECs from 
c-kit??? eTomate/+ knock-in mice. The nuclear 
tdTomato signal in these mice faithfully reca- 
pitulates the expression pattern of endogenous 
KIT (/7). Using click chemistry to fluorescently 
label EdU* proliferating ECs, we found that 
METRNL and SCF stimulated c-kit??? 70. 
positive but not c-kit??? 470" _n egative 
EC proliferation (fig. S12). To survey KIT- 
expressing EC behavior in the infarct border 
zone, we subjected c-kit??? @70"“°/* mice to 
acute MI. As we had observed by single-cell 
RNA sequencing (Fig. 4A), c-Rit??2 te7omato_ 
negative ECs outnumbered c-kit#?2 tome. 
positive ECs under baseline conditions and 
after MI (Fig. 4B). Yet expansion of both 
populations contributed about equally to the 
increase in capillary density in the infarct bord- 
er zone (Fig. 4B). Treating c-hit??? 70mato/+ 
mice with KIT-ECD-Fc (100 ug into the LV 
cavity immediately after MI) selectively pre- 
vented c-kit? _yositive EC expansion in 
the infarct border zone (Fig. 4B) and worsened 
LV dysfunction after MI (table S4), indicat- 
ing that growth factor(s) signaling through 
KIT stimulate KIT-expressing EC expansion 
and help preserve heart function after MI. 

Unlike METRNL, SCF was expressed by 
ECs, fibroblasts, and cardiomyocytes in the 
infarct region (fig. S13A), and its expression 
did not increase after MI (fig. S13B). Further, 
we observed no up-regulation of SCF in the 
infarct region of Metrni’~ mice (fig. S13C). 
To explore whether METRNL drives KIT- 
expressing EC expansion after MI, we crossed 
the c-kit??? 7omao allele into Metrnl~ mice. 
Metrnl deletion did not affect c-kit??? 70". 
positive or -negative EC densities in the myo- 
cardium under baseline conditions (Fig. 4C), 
indicating that METRNL is not necessary for 
establishing either cell population during de- 


velopment. After MI, however, c-hit??2 Tomato 
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positive EC numbers failed to increase in 
MetrnI~ mice, whereas c-kit???o (tome. 
negative EC expansion was not affected (Fig. 4C). 

To explore METRNL’s therapeutic potential, 
we treated heart failure-prone FVB/N mice 
(25) with METRNL after MI (bolus injection 
into the LV cavity immediately after MI, fol- 
lowed by subcutaneous infusion) (fig. S14A). 
METRNL treatment enhanced infarct border 
zone capillarization (fig. S14B), limited scar 
formation (fig. S14C), and exerted sustained 
beneficial effects on LV function (table $5). In 
c-kit’? 2 aTomato/*+ reporter mice, these ther- 
apeutic effects were associated with selective 
expansion of c-kit??? 70™@°_positive ECs in 
the infarct border zone (fig. S14D), showing 
that endogenous METRNL’s effects on angi- 
ogenesis and heart function after MI can be 
therapeutically enhanced. METRNL therapy 
did not affect infarct size and cardiomyocyte 
apoptosis in the infarct border zone 24 hours 
after injury (fig. $14, E and F) or cardiomyocyte 
cell cycle activity in the left ventricle during 
the first week after MI (fig. S14, G and H). 
Notably, treating c-kit'?270"-/* mice with 
SCF expanded the c-kit??? 470" positive EC 
population in the infarct border zone and im- 
proved systolic function, as did treating the 
mice with METRNI (fig. S14, I and J). Increas- 
ing the availability of either ligand therefore 
promotes similar beneficial effects after MI. 

In summary, we define a METRNL-based 
cross-talk between myeloid cells and ECs that 
promotes tissue repair after MI. We identify 
KIT as the cell surface receptor that mediates 
METRNL's angiogenic effects. While METRNL 
and SCF both bind to KIT with high affinity, 
engendering concordant phosphoproteome sig- 
natures in cultured human ECs, the two ligands 
exert nonredundant functions during devel- 
opment and after injury. In contrast to Scf- 
deficient mice, which die of severe anemia 
around the time of birth (26), Metrni-deficient 
mice develop normally with no overt hemato- 
logical abnormalities (fig. S15 and table S6). 
After MI, myeloid cell-derived METRNL drives 
expansion of the KIT-expressing EC population 
in the infarct border zone. This KIT-dependent 
angiogenic response is severely blunted in 
Metrni-deficient mice, indicating a limited role 
for SCF in this setting, a conclusion in line with 
SCF’s persistently low cardiac expression after 
MI (fig. S13) (27). Myeloid cell production 
of METRNL may thus provide a mechanism 
for activating endothelial KIT in the context of 
tissue injury and inflammation. Further studies 
may reveal other cell types and pathological 
conditions in which METRNL-KIT signaling 
is important. 
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By llya Smolensky 
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More than my hands 


was sitting in a chemistry lab class during my first year of university, nervous about the experiment 
we were to perform. The procedure required us to transfer a solution from one tube to another. 
I grabbed a pipette and, as I feared, my hand started to shake—a symptom of cerebral palsy. I asked 
my lab partner whether they could perform the task instead. The experience was disheartening: I 
was hoping to pursue a career in science, but I started to wonder whether that would be possible if 
I had trouble completing simple lab experiments. I thought my dreams had crashed to the ground. 


I was born after my twin brother. His 
birth was free of complications, but I 
was strangled by an umbilical cord 
and born with severe hypoxia. After 
a few weeks in the neonatal inten- 
sive care unit, I was diagnosed with 
cerebral palsy. My family managed 
to find good doctors where we lived, 
in Leningrad (now St. Petersburg), 
Russia, and I took part in clinical tri- 
als testing new interventions. Shortly 
after my first birthday, I started 
walking and it became clear my 
cognitive functions were unaffected. 
So in some sense I was lucky. 

I couldn’t do some things grow- 
ing up. My left hand was (and still 
is) much worse than my right—I 
was mostly limited to using two 
fingers as pincers. I learned to 
write and do simple tasks with my 
right hand, but it wasn’t dexterous 
enough to do anything that required precision. Both hands 
also shook, especially when I was nervous or embarrassed. 

As a teenager, I faced a lot of bullying at school. Feeling 
alone, I joined a study group focused on the natural world, 
thinking I would like to live far from any people, studying 
animals. That’s how I came into the field of biology. 

At university, I enjoyed the lectures in my science classes. 
Many lab tasks proved impossible, however. Once, during a 
field course on an island in the White Sea, in the far north of 
Russia, I struggled to put a small marine worm in a petri dish. 
I spent that night walking around in the woods, wondering 
what else I could do other than biology. 

As I struggled with my mood, I read a book about depres- 
sion. From then on, the physiology of mental disorders be- 
came my scientific passion. I looked into what was being done 
locally and was excited to discover a lab that did behavioral 
experiments in rats to study depression. I thought that even 
with my shaking hands, I was capable of putting a rat into a 
maze and observing its behavior. I wanted to join that lab. 

At the end of my second year, I approached the professor 
to see whether I could work with her. I was afraid to admit 


“There’s much more to being 
a scientist than performing 
the physical labor.” 


I couldn’t do some lab tasks. To my 
relief, she was completely support- 
ive. “Don’t worry, we will find work 
to occupy you,” she said. 

She set me to work performing 
behavioral experiments for others in 
the lab. Later, when the time came 
for me to produce a senior thesis, I 
used data I collected and hormonal 
data collected with the help of a col- 
league who completed the pipetting 
steps. I loved the supportive atmo- 
sphere so I stayed there to complete 
my master’s and Ph.D. 

When I graduated, I felt an itch to 
focus on molecular mechanisms of 
mental disorders. I landed a position 
that involved computer modeling, 
but I quit after 8 months because 
writing computer code was both 
frustrating and boring for me. After 
that I switched to a lab studying epi- 
lepsy in rats. I could do the work, but it wasn’t my passion. 

Eventually I started to look for positions abroad, buoyed 
by advice a doctor once gave me: “If you are smart enough, 
you will be a scientist regardless of your hands.” During an 
interview for a postdoc in Switzerland, I revealed that I have 
cerebral palsy and can only do behavioral experiments. A few 
days later I received a job offer, which included a detail that 
the lab would hire a technician for any delicate tasks associ- 
ated with my project. 

I’ve been in that lab now for 2 years, and I’ve come to real- 
ize that my hands aren’t the impediment I thought they were. 
By finding the right people, being open about my abilities, 
and working as part of a team, I’ve been able to follow my 
passions. I’ve also realized that there’s much more to being a 
scientist than performing the physical labor. I may not collect 
all the data in my papers, but I’m fully capable of devising ex- 
periments and interpreting results—which, to me, is the most 
exciting part of science. 


llya Smolensky is a postdoc at the University of Fribourg and the 
University of Basel. Send your career story to SciCareerEditor@aaas.org. 
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